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RA antigenic peptides 

The present invention provides novel naturally-processed RA antigenic peptides 
which are candidate markers for erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC class II HLA-DR molecules. Moreover, these antigenic peptides 
5 linked to MHC class II molecules, as well as antibodies reactive with said antigenic 
peptides, nucleic acids encoding said antigenic peptides, and nucleic acid constructs and 
host cells for expressing said antigenic peptides are provided. The antigenic peptides of the 
invention as well as the polypeptides they are derived from can be used as markers in 
diagnosis of RA and in therapy as anti-RA vaccines. 

10 

Rheumatoid Arthritis (RA), originally termed chronic polyarthritis, is a systemic 
autoimmune disease and one of the most debilitating forms of articular inflammation 
(Feldmann, M. et al.. Cell 85 (1996) 307-310; Dedhia, H.V. & DiBartolomeo, A., Critical 
care clinics 18 (2002) 841-854). Typically, RA causes joint pain, deformities and severe 

15 joint stiffness. The disease can also have its manifestation outside the joints, especially in 
patients who are positive for an autoantibody, termed "rheumatoid factor" (RF) (Mageed, 
R.A., in: van Venrooij, W.J. & Maini, R.N. ed$., Manual of biological markers of disease, 
Kluwer Academic Publishers (1996) 1-18). RA occurs quite frequently in the Caucasian 
population with the susceptibility to RA being influenced by genetic and environmental 

20 factors. Both have a crucial effect on the onset and the progression of this autoimmune 
disease. Approximately 4% of the total population has an increased genetic susceptibility to 
RA, roughly 20% of which (around 1% of the total population) develops RA as a result of, 
as yet, uncharacterized non-inheritable factors. Beyond that, RA shows a significant bias in 
the sex ratio: women have a three fold higher risk for RA than men, indicating that sex 

25 hormones may also be involved in the pathogenesis. 
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In the beginning, RA progresses slowly. Typical early stage symptoms are 
palm sweating, morning stiffness of fingers and symmetrical joint inflammation 
(www.medicine-worldwide.de). In addition, rheumatoid nodules can appear which is an 
indication for tissue affection outside the joints. In a simplified model, the immune system 

5 produces autoantibodies against healthy tissue (www.netdoktor.de). These autoantibodies 
attack the articular cartilage in the joint leading to its inflammation and later on to its 
destruction. This destruction stimulates the immune system to produce more 
autoantibodies. In addition, cytokines like tumor necrosis-factor alpha (TNF-a) and 
Interleukin-1 (IL-1) are produced which enhance the inflammatory reaction even further 

10 (Houssiau, F.A., Clin Rheumatol 14 Suppl 2 (1995) 10-13). The synovium begins to swell 
due to infiltration of additional cells of the immune system, such as macrophages and 
T cells. These cells are actively involved in causing further cell death and in driving joint 
inflammation (Fox, D.A., Arthritis Rheum 40 (1997) 598-609; Choy, E.H. & Panayi, G.S., 
N Engl J Med 344 (2001) 907-916). This process resembles a vicious circle of autoantibody 

15 production, joint inflammation and joint destruction. 

Typically, RA progresses chronically, with 85-90% of all RA patients showing a mild 
to moderate disease development. Aggressive disease forms leading to complete loss of 
joint function up to the degree of invalidity is experienced by 10-15% of the patients. In 
this advanced RA state, patients have a permanent articular inflammation and display 
20 rheumatoid nodules. They suffer from strong chronical pain and the inflammation leads to 
severe finger stiffness and irreversible joint deformations or dislocations. 

Diagnosis 

There is growing evidence that therapeutic intervention early in the disease can 
reduce the extent of joint damage (Egsmose, C. et aL, J Rheumatol 22 (1995) 2208-2213; 

25 Van der Heide, A. et aL, Ann Intern Med 124 (1996) 699-707). Since treatment with 
disease-modifying antirheumatic drugs (DMARDs) is only justified when the riskbenefit 
or costieffectiveness ratios are favorable, it is mandatory to be able to differentiate between 
RA and other forms of arthritis shortly after onset of the disease (Kirwan, J.R. 8c Quilty, B., 
Clin Exp Rheumatol 15 (1997) 15-25). The diagnosis is made by established criteria based 

30 on clinical history, physical examination and laboratory tests. The American Society of 
Rheumatism published a catalog of criteria to help gaining objective evidence for RA 
(Arnett, F.C. et aL, Arthritis Rheum 31 (1987) 315-324). But so far, not a single test is 
available which is specific for RA. Several biological and biochemical markers, e.g. C- 
reactive protein (CRP), erythrocyte sedimentation rate (ESR), antinuclear antibody (ANA) 

35 or RF are utilized for the evaluation of RA. However, these markers are non-specific, as 
they appear in other inflammatory or autoimmune diseases as well. The RF, for instance, is 
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an autoantibody that is present in the serum of approximately 50% of RA patients. Since 
increased levels of the same autoantibody can also be found in the context of other 
inflammatory diseases, such as Sjogren syndrome, endokarditis or chronical hepatitis, RF 
is unsuitable to serve as a diagnostic marker for RA. Rather than being of diagnostic value 
5 per se, the above mentioned biochemical and biological markers are useful for assessing 
disease activity and prognosis as well as in the treatment and management of RA patients 
(Nakamura, R.M., J Clin Lab Anal 14 (2000) 305-313). 

Recently, a diagnostic set of criteria was developed that consists of clinical and 
biochemical aspects which were claimed to discriminate, at an early state, between self- 

10 limiting, persistent non-erosive, and persistent erosive RA (Visser, H. et al., Arthritis 
Rheum 46 (2002) 357-365). Self-limiting arthritis was characterized by natural remission: 
there was no arthritis on examination in a patient for a certain period of time. Erosive 
arthritis was defined based on the presence of erosions on radiographs of the hands and/or 
feet. In particular, the use of antibodies recognizing cyclic citrullinated peptides appears to 

15 be promising and suggests an important role for citrullinated antigens in the early 
diagnosis and prognosis of erosive RA (Schellekens, G.A. et al., J Clin Invest 101 (1998) 
273-281; Vincent, C. et al., J Rheumatol 25 (1998) 838-846). The early recognition of 
erosive RA allows early intervention with DMARDs, which will lead to earlier disease 
control and improvement of disease outcome (Symmons, D.P.M. et al., J Rheumatol 25 

20 (1998) 1072-1077; Anderson, J.J. et al., Arthritis Rheum 43 (2000) 22-29). Likewise, early 
recognition of self-limiting and non-erosive arthritis will prevent unnecessary treatment 
with potentially toxic therapeutics (Fries, J.F. et al., Arthritis Rheum 36 (1993) 297-306. 

Therapy 

The goal of any anti-rheumatic therapy is to relieve pain in order to ease the activities 
25 of every day life. So Car, complete healing of RA is not possible, but by applying modern 
therapies the progression of the disease can be slowed down or even stopped. Due to 
individual differences, each patient requires an individualized therapy and early diagnosis, 
as mentioned before, is desirable. RA therapy is complex and includes lifelong medicinal 
treatment as well as physio- and radiotherapy. DMARDs used in RA therapy are basic 
30 therapeutics (e.g. Methotrexate, Sulfasalazin, Hydroxychloroquin, Leflunomid, 
Azathioprin), cortisone, non-steroidal anti-inflammatory drugs (NSAID) or monoclonal 
antibodies against the pro-inflammatory cytokines TNF-a, IL-10 or their respective 
receptors (http://rheuma-online.de). These drugs have all in common that they are 
inhibitors of inflammation by suppressing the immune response. The main disadvantage is 
35 their lack of specificity for RA, their adverse effects and their inability to effectively target 
the causes of RA. 
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Autoimmunity 

Autoimmunity starts when a specific adaptive immune response is initiated against 
self antigens (autoantigens) manifested by the development of self-reactive T or B cells. 
The normal consequence of an adaptive immune response against a foreign antigen is the 

5 clearance of the antigen from the body. When an adaptive immune response develops 
against a self antigen, however, the antigen can in most cases not be completely removed 
from the body, leading to a sustained immune response. As a consequence, the effector 
mechanisms of immunity cause chronic inflammatory injury to tissues. The mechanisms 
of tissue damage are essentially the same in autoimmune disease as those that operate in 

10 protective immunity and in hypersensitivity. Even though it is not well understood what 
triggers autoimmunity, several events which are nowadays believed to contribute to the 
induction of autoimmune diseases and selection of autoantigenic targets have been 
summarized most recently (Marrack, P. et al., Nat Med 7 (2001) 899-905). 

Autoimmune diseases are controlled by properties of particular genes of each 
15 individual and environmental factor. The host's genes affect the susceptibility to 
autoimmunity at least at three levels. First, some of the genes affect the overall reactivity of 
the immune system and, thus, can predispose the individual to certain or to several 
different types of autoimmune diseases. Second, this altered immunoreactivity is tunneled 
to particular autoantigens and tissues by other genes that affect recognition of antigenic 
20 peptides by T cells. Third, still other genes act on the ability of target tissues to modulate 
immune attack for instance by influencing the activity of effector cells of the immune 
system which are destined to initiate an autoaggressive attack. The latter two sets of genes 
dictate which antigens will be the targets of autoimmunity and hence which organs will be 
attacked and what damage will occur. 

25 In addition, signals from the environment influence the development of 

autoimmunity at the same three levels, by affecting the overall reactivity of the immune 
system, the antigen-specificity and the state of the potential target tissue. And finally, there 
is cross-talk between genetic and environmental factors. 

Major histocompatibility complex (MHC) 

30 Population studies, genotyping and modern approaches at the molecular level have 

unanimously shown that certain genes encoded by the major histocompatibility complex 
(MHC) confer a significantly higher risk for the development of RA (Stastny, P., Tissue 
Antigens 4 (1974) 571-579; Wordsworth, P. et al., PNAS 86 (1989) 10049-10053; 
Wordsworth, P. & Bell, J., Springer Semin Immunopathol 14 (1992) 59-78). In particular, 

35 the class II MHC alleles HLA-DRB1*0101 7 *0401> *0404 and *0405 in several ethnic groups 
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increase the susceptibility to RA (Reveille, J., Curr Opin Rheumatol 10 (1998) 187-200). 
E.g. more than 90% of RF-positive RA patients carry one of these susceptibility 
alleles. HLA class II molecules are MHC-surface proteins that bind antigenic peptides 
within the cell and present them on the surface of antigen-presenting cells for interaction 

5 with the Tcell receptors of CD4 + helper T lymphocytes, thereby initiating a cellular 
immune response (Banchereau, J, 8c Steinman, R.M., Nature 392 (1998) 245-252). The 
RA-association of particular HLA class II molecules together with the presence of large 
numbers of activated CD4 + T cells in synovial tissue has supported the model of disease 
induction in which disease-associated HLA-DR molecules present disease-relevant (e.g. 

10 synovial) autoantigens and cause stimulation and expansion of synovial T cells, which then 
drive the inflammatory process (Striebich, C.C. et al., J Immunol 161 (1998) 4428-4436). 

MHC class II HLA-DR (short: DR) proteins are heterodimers consisting of 
monomorphic a- and extremely polymorphic p-chains that bind peptide antigens in a 
peptide binding groove. This groove generally has four major pockets to accept side chains 
15 at relative positions 1, 4, 6 and 9 of the peptide (Stern, L.J. et al., Nature 368 (1994) 215- 
221). The allelic variations between HLA class II molecules account for the differential 
ability to bind antigenic peptides. This is the rationale why individuals differing in their 
HLA alleles have divergent antigenic peptide repertoirs, thereby leading to differences in 
the quality of immune responses (Messaoudi, I. et al., Science 298 (2002) 1797-1800). 

20 Peptides bound by class II MHC molecules are typically longer and more 

heterogeneous in size (11-25 amino acids) than the peptides bound by class I MHC 
molecules (8-10 amino acids). This difference arises because the peptide binding groove of 
class II proteins is open and while peptides are gripped in the midle, their ends can extend 
out of the groove in a variable fashion (Jones, E.Y., Curr Opin Immunol 9 (1997) 75-79). 

25 As "a consequence, class II molecules typically bind sets of overlapping peptides that share a 
common core sequence, termed "T cell epitope", but have different lengths. 

More than a decade ago, it was recognized that the DRp chains encoded by RA- 
linked DRB1 alleles, although exhibiting polymorphic differences, all share a stretch of 
identical or almost identical amino acids at positions 67-74, known as the "shared epitope" 

30 (Gregersen, P.K. et al., Arthritis Rheum 30 (1987) 1205-1213). Since immunity to 
autoantigens has been regarded central to the pathogenesis of RA, it was hypothesized that 
the shared epitope could impose disease linkage on the respective DR molecules by at least 
two different mechanisms: first, by selecting the relevant autoantigenic peptides for 
presentation, and second, by selecting the appropriate autoreactive Tcell specificities 

35 during ontogeny. The three-dimensional structure of DR molecules has indeed revealed 
that the shared epitope is located in the center of the ct-helix flanking one side of the 
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peptide binding groove (Stern, LJ. et al., Nature 368 (1994) 215-221). Thus, 
strategically this shared epitope region is positioned in such a way that it can interact 
with both bound peptide and T cell receptor. 

However, one of the unresolved mysteries in rheumatology research is the question 
5 what are the key arthritogenic antigens and epitopes in man that trigger the onset and the 
development of RA. Although autoantibodies of different specificity have been identified 
in serum and synovial fluid of patients it is often unclear whether the antigens which were 
released at the time of cartilage degradation, were initiating pathogenicity or whether they 
are merely a consequence of antigen spreading as a result of inflammation (Corrigall, V.M. 
10 &Panayi G.S., Crit Rev Immunol 22 (2002) 281-293). Furthermore it is difficult to define 
pathogenic mechanisms in which the antigen is present throughout the body, including the 
joint, but the pathology is targeted solely or predominately to the joint. 

Autoantigens 

The large number of possible autoantigens in RA is derived from studies using sera 

15 or, less frequently, T cells from patients with established chronic RA. One of the most 
convincing joint-specific antigen that has been proposed in the context of DR molecules, is 
type II collagen (CII), the predominant protein in articular cartilage. Autoantibodies 
against CII were found in elevated concentrations in the serum and joints of RA patients 
although it is not yet clear whether anti-CII antibodies are pathogenic in RA (Banerjee, S. 

20 et al., Clin Exp Rheumatol 6 (373-380). Snowden and coworkers have shown that 
peripheral blood T cells from RA patients proliferated to CII, most pronounced in those 
patients with anti-CII antibodies. However, the response was seen only in 50% of patients 
(Snowden, N. et al., Rheumatology 40 (1997) 1210-1218). In a mouse model 
immunization with CII was shown to induce arthritis in mice expressing the class II MHC 

25 alleles DRB1*0401 and *0101 (Rosloniec, E.F. et al., J Exp Med 185 (1997) 1113-1122; 
Rosloniec, E.F. et al., J immunol 160 (1998) 2573-2578). The immunodominant epitope in 
both *0401 and *0101 transgenic mice was localized to peptides within residues 261-273 of 
human CII (Fugger, L. et al., Eur J Immunol 26 (1996) 928-933). The same epitope of CII 
was capable of stimulating a T cell response in RA patients, particularly in the early stages 

30 of disease. Synovial fluid T cells were especially responsive (Kim, H.Y. et al., Arthritis 
Rheum 42 ( 1999) 2085-2093). 

Although other cartilage proteins have been proposed as RA candidate antigens, 
DR4-binding epitopes have been defined only for human cartilage glycoprotein 39 
(HCgp39). This protein is secreted by synovial cells and articular chondrocytes and its 
35 expression is upregulated in plasma and joints during inflammation (Vos, K. et al., Ann 
Rheum Dis 59 (2000) 544-548). Similar to CII, HCgp39 treatment induces arthritis in 
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mice. In addition a HCgp39 response of peripheral blood T cells from RA patients 
was detected (Verheijden, G.F. et al., Arthritis Rheum 40 (1997) 1115-1125). The 
predominant epitope recognized by T cells in DR4 patients was defined between residues 
263-275 and identical to the immunodominant epitope found in DRB1*0401 -transgenic 

5 mice after immunization with native HCgp39 (Cope, A.P. et al., Arthritis Rheum 42 (1999) 
1497-1507), Although not disease specific, responses to this peptide did correlate with 
disease activity in RA patients (Baeten, D. et al., Arthritis Rheum 43 (2000) 1233-1243). 
Antibodies to HCgp39, however, have also been detected in the sera of patients with 
inflammatory diseases, such as inflammatory, bowel disease and systemic lupus 

10 erythematosus (SLE), albeit at a lower level than in RA. 

In an attempt to track antigen-specific T cells in RA, soluble peptide-DR4 tetrameric 
complexes were used to detect synovial CD4 + T cells reactive with CII or HCgp39 in DR4* 
patients (Kotzin, B.L. et al, PNAS 97 (2000) 291-296). The CII-DR4 complex bound in a 
specific manner to CII peptide-reactive T cell hybridomas, but did not stain a detectable 
15 fraction of synovial CD4 + cells. Almost similar results were obtained with the HCgp39- 
DR4 complex suggesting that the major oligodonal CD4 + T cell expansions present in RA 
joints are not specific for the dominant CII and HCgp39 determinants described above. 

In summary, despite some strong indications for a CII and HCgp39 association with 
RA, the evidence that they are important antigens in RA is scanty. A direct proof that 
20 peptides of CII or HCgp39 are presented in a class II MHC-restricted manner by antigen- 
presenting cells with subsequent stimulation and activation of synovial CD4 + T cells is still 
lacking. Furthermore a major problem of animal models is their unknown relevance to RA 
as CH-induced arthritis by immunizing rats or mice differs in many respects from RA. 

Naturally processed MHC class H-associated peptides x 

25 An alternative strategy to the identification of RA-specific autoantibodies and T cells 

relies on the sequence analysis of naturally processed peptide antigens bound to MHC class 
II molecules. With the help of monoclonal antibodies, class II MHC molecules conferring 
susceptibility to RA can be purified from cognate cells. RA-associated peptide antigens can 
be acid-eluted from purified HLA class II molecules. The mixture of small peptides can be 

30 separated by HPLC and the peptide sequence be determined by Edman sequencing or mass 
spectrometry. Due to limitations with peptide purification and sequencing techniques, 
peptide sequences were, as yet, only obtained from MHC molecules that have been isolated 
from cultured B cell lines or large amounts of tissue, and the analysis was restricted to a 
few abundant peptides (Kropshofer et al., J.Exp.Med. 175 (1992) 1799-1803; Chicz, R.M. et 

35 al, J Exp Med 178 (1993) 27-47). As a result of the development of high-resolution 
microcapillary HPLC columns and more sensitive mass spectrometers, MHC-bound 
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peptides can be analyzed more efficiently (Dongre, A.R. et al., Eur J Immunol 31 
(2001) 1485-1494; Engelhard, V.H. et aL, Mol Immunol 39 (2002) 127-137). 

In the present invention a modified peptide isolation and sequencing technique was 
used to investigate the peptide antigen repertoire of HLA-DR4 molecules derived from 
5 autologous dendritic cells (DCs) which were pulsed with serum or synovial fluid derived 
from RA patients. The main advantage of this innovative approach is the usage of human 
DCs that are professionals in RA-relevant antigen processing and presentation, instead of 
using transgenic animal models or artificial B cell lines. 

DCs are enriched in rheumatoid synovial fluid and tissue and are derived from 
10 circulating immature precursors (Thomas, R. et al., J Immunol 152 (1994) 2613-2623). 
They are the most potent antigen-presenting cells which express high levels of MHC 
molecules together with a variety of accessory molecules (Mellman, I. et al., Trends Cell 
Biol 8 (1998) 231-237). In a most recent study, it was shown that ex vivo differentiated 
human DCs and macrophages that are phenotypically similar to antigen-presenting cells 
15 from RA synovial joints, were capable of generating and presenting immunodominant 
epitopes from CII and HCgp39 (Tsark, E.C. et al., J Immunol 169 (2002) 6625-6633). DC 
have the capacity to prime CD4 + helper T cells and to effectively activate cytotoxic CD8* 
T cells (Ridge, T. et al., Nature 393 (1998) 474-478). Thus, peptides bound to MHC class II 
molecules and presented by DCs play a superior role in the pathogenesis of diseases 
20 involving T cell-driven immune responses. 

Therefore, the problem posed by the lack of knowledge of MHC class II restricted 
antigenic peptides for RA is solved by providing novel naturally-processed MHC class II 
associated RA antigenic peptides and the polypeptides they are derived from as markers for 
RA. 

25 

The present invention provides novel naturally-processed antigenic peptides which 
are candidate RA markers in erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC class II HLA-DR molecules derived from dendritic cells which 
were pulsed with serum or synovial fluid derived from patients with established erosive or 

30 non-erosive RA. The MHC class II antigenic peptide of the invention are comprising (a) at 
least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57, or (b) at least the amino acid sequence of the peptide 
binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 with additional 
N-and C- terminal flanking sequences of a corresponding sequence selected from the group 

35 consisting of SEQ ID NOs. 1 to 39, and originate from interferon-y-inducible lysosomal 
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thiol reductase, apolipoprotein B-100, inter- ot-trypsin inhibitor heavy chain H4, 
complement C4, complement C3, SH3 domain-binding glutamic acid-rich-like 
protein 3, interleukin-4-induced protein 1, hemopexin, and Hsc70-interacting protein. 
The present invention also provides these antigenic peptides and the proteins they are 
5 derived from as markers for erosive and/or non- erosive RA. Moreover, these antigenic 
peptides linked to MHC class II molecules, as well as antibodies reactive with said antigenic 
peptides, nucleic acids encoding said antigenic peptides, and nucleic acid constructs, host 
cells and methods for expressing said antigenic peptides are provided. Further methods are 
provided for isolating and identifying RA antigenic peptides, 

10 

Fig. 1: Diagram of Dendritic cell (DC)-mediated analysis of tissue samples: 
Dendritic cells (DCs), the most specialized antigen-presenting cells (APCs), are brought in 
contact with an antigen source (e.g. synovial fluid) under optimal conditions for antigen 
uptake and antigen processing. As a control, DCs are cultured under the same conditions 
15 in the absence of synovial fluid antigens. After maturation of DCs, antigen-loaded MHC 
class II molecules are purified and the respective MHC class II-associated antigenic 
peptides are isolated and identified. 

Fig. 2A: ION-TRAP MS Base Peak Chromatogram of MHC class II-associated 
antigenic peptides that were isolated from dendritic cells pulsed with the serum of a RA 
20 patient. The peptides were eluted directly from a RP-C18-HPLC column into the ion trap 
mass spectrometer for immediate MS/MS identification. The numbers indicate the 
retention times (upper value) and the molecular masses (lower value) of the most 
prominent peptide peaks in the mixture at the respective time. 

Fig. 2B: ION-TRAP MS spectrum of antigenic peptides at a retention time of 
25 65.4 min. The marked peak was further fragmented and corresponded to a doubly charged 
peptide ion from the inter-alpha-trypsin inhibitor ITIH4 (cf. table 3). 

Fig. 2C: ION-TRAP MS/MS spectrum of the doubly charged peptide ion at m/z 
977.1. The fragmentation masses, together with the mass of the parent ion, were searched 
against a non-redundant human database by using the SEQUEST algorithm. The retrieved 
30 sequence MPKNWFVIDKSGSMSGR (one-letter-code) corresponded to the dominant 
epitope ITIH4 (271-288) of the inter-alpha-trypsin inhibitor. The positions of the assigned 
series of N-terminal B-ions and C-terminal Y-ions are marked. 
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The antigenic peptides of the invention are peptides, which are associated 
with and presented by MHC molecules and thereby can have the potential to activate or 
tolerize T cells. Antigenic peptides presented by MHC class II molecules are therefore 
MHC class II associated or MHC class II antigenic peptides, whereas antigenic peptides 
presented by MHC class I molecules are MHC class I associated or MHC class I antigenic 
peptides. 

Peptides which are derived from proteins that are encoded in the genome of the body 
or an APC are denoted as "self-peptides". The main function of self-peptides presented by 
DCs in the peripheral lymphoid organs is thought to be the induction of T cell tolerance to 
self-proteins. Tolerance is the failure to respond to an antigen; when that antigen is borne 
by self tissues, tolerance is called self tolerance. 

Antigens which are derived from an individual's own body are called "self antigens" 
or "autoantigens". An adaptive immune response directed against self antigens is called an 
autoimmune response. Likewise, adaptive immunity specific for self antigens is called 
autoimmunity. Autoreactivity describes immune responses directed against self antigens. 
RA is probably due to an autoimmune response that is based on the involvement of 
autoreactive T cells and/ or autoreactive antibodies. Immunogenic peptide includes, but is 
not limited to, an antigenic peptide capable of causing or stimulating a cellular or humoral 
immune response. Such peptides may also be reactive with antibodies. 

Peptides derived from proteins encoded in the genome of bacteria, viruses or other 
foreign invaders and which differ from self-proteins are called "foreign antigenic" or 
"foreign" peptides. They are able to elicit a T cell response against foreign proteins they are 
derived from. 

RA antigenic peptides are self-peptides that function as self antigens and as a 
consequence of the disease erroneously trigger autoreactivity against self tissues. 



The present invention provides a MHC class II antigenic peptide comprising (a) at 
least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57, or (b) at least the amino acid sequence of the peptide 
binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 with additional 
N-and C-terminal flanking sequences of a corresponding sequence selected from the group 
consisting of SEQ ID NOs. 1 to 39. Preferably, the MHC class II antigenic peptide has a 
length of less than 26 amino acids, more preferably a length of 11 to 25 amino acids. Even 
more preferred is the antigenic peptide of the invention with a length of 11 to 19 amino 
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acids. Most preferred is the antigenic peptide of the invention consisting of the 
peptide binding motif comprising the four anchor amino acids. 

The present invention also provides a MHC class II antigenic peptide comprising (a) 
at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at 
5 least the amino acid sequence of the peptide binding motif of SEQ ID NO. 49 with 
additional N-and C-terminal flanking sequences of a corresponding sequence selected 
from the group consisting of SEQ ID NOs. 1 to 3. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
10 amino acid sequence of the peptide binding motif of SEQ ID NO. 50 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5. 

The MHC class II associated novel antigenic peptides of the invention originate from 
interferon-Y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 3), apolipoprotein B- 
100 (SEQ ID NOs. 4 and 5), inter-ct-trypsin inhibitor heavy chain H4 (SEQ ID NOs. 6 to 
15 12), complement C4 (SEQ ID NOs. 13 to 18), complement C3 (SEQ ID NOs. 19 to 23), 
SH3 domain-binding glutamic acid-rich-like protein 3 (SEQ ID NOs. 24 to 27), 
interleukin-4-induced protein 1 (SEQ ID NOs. 28 to 30), hemopexin (SEQ ID NOs. 31 to 
35), and Hsc70-interacting protein (SEQ ID NOs. 36 to 39). 

The single peptide binding groove of MHC class II molecules is about 25 A long, but 
20 in contrast to MHC class I molecules, both sides are open (Stern LJ et al., Nature 1994; 368, 
215-221). Thus, naturally processed antigenic peptides eluted from human MHC class II 
molecules have a minimal length of about 11 residues and attain a maximal length of about 
25 residues (Chicz RM et al., J Exp Med 1993; 178, 27-47). 

The stability of the MHC-peptide interaction is determined by more than a dozen 
25 hydrogen bonds involving the peptide backbone and the complementarity between 
specificity pockets of the binding groove and appropriately located amino acid side-chains 
of the peptide. The amino acids, of the peptide fitting into the respective pockets were 
named "anchor" residues. With regard to most HLA-DR alleles, these anchors are located 
at relative positions PI, P4, P6 and P9. The combination of amino acids at these 4 anchor 
30 positions conferring high-stability binding to the respective HLA-DR allelic product and 
vary from allele to allele. The peptide binding motif is defined herein as the sequence of 
nine amino acids comprising the four anchor amino acids. The peptide binding motif of 
the MHC class II antigenic peptide of the invention is depicted in SEQ ID NO. 49 for the 
peptides derived from interferon-y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 
35 3), in SEQ ID NO. 50 for the peptides derived from apolipoprotein B-100 (SEQ ID NOs. 4 
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and 5), in SEQ ID NO. 51 for the peptides derived from inter-a-trypsin inhibitor 
heavy chain H4 (SEQ ID NOs. 6 to 12), in SEQ ID NO. 52 for the peptides derived 
from complement C4 (SEQ ID NOs. 13 to 18), in SEQ ID NO, 53 for the peptides derived 
from complement C3 (SEQ ID NOs. 19 to 23), in SEQ ID NO. 54 for the peptides derived 

5 from SH3 domain-binding glutamic acid-rich-like protein 3 (SEQ ID NOs. 24 to 27), in 
SEQ ID NO. 55 for the peptides derived from interleukin-4-induced protein 1 (SEQ ID 
NOs. 28 to 30), in SEQ ID NO. 56 for the peptides derived from hemopexin (SEQ ID NOs. 
31 to 35), and in SEQ ID NO. 57 for the peptides derived from Hsc70-interacting protein 
(SEQ ID NOs. 36 to 39). The peptide binding motif may also comprise at least one, at least 

10 two, at least three, at least four or at least five modifications of the amino acid sequence 
while still attaining the binding capacity of the non-modified peptide binding motif. 
Preferably, the modified peptide binding motif comprises at least three of the four anchor 
amino acids of the non-modified peptide binding motif. The amino acid modification may 
be a conservative amino acid substitution as described below. 

15 Additional binding energy is provided by hydrogen bonds involving residues in front 

of the PI anchor and behind the P9 anchor. In agreement with that, in most naturally 
processed peptides the nonameric core-region (P1*P9) is N- and C-terminally flanked by 
3-4 residues. Hence, the majority of peptides are 15-17-mers. Longer peptides protrude 
from the groove, thereby allowing access of exopeptidases which are trimming both ends. 

20 Therefore, the MHC class II antigenic peptide of the invention comprising (a) at 

least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57, or (b) at least the amino acid sequence of the peptide 
binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 with additional 
N- and C-terminal flanking sequences of a corresponding sequence selected from the 

25 group consisting of SEQ ID NOs. 1 to 39, preferably comprises additional N- and C- 
terminal flanking amino acid residues providing additional binding energy. 

Preferably, the MHC class II antigenic peptide of the present invention has a binding 
capacity to the corresponding MHC class II molecule, of between one tenth and ten-fold 
the IC 5 o of a corresponding peptide selected from the group consisting of SEQ ID NOs. 1 
30 to 39. The binding capacity of a peptide is measured by determining the concentration 

necessary to reduce binding of a labelled reporter peptide by 50%. This value is called IC 50 . 
A MHC class II antigenic peptide of the invention maintains its binding capacity to the 
relevant HLA class II molecules as long as it attains IC 50 values between one tenth and 10- 
fold the IC 50 of the established reference peptides. 



35 



13 



Since peptide trimming occurs in an individual fashion both before and after 
binding into the peptide binding groove, the occurrence of several truncation 
variants sharing a common nonameric core region is a common feature of MHC class II- 
bound peptides. Importandy, it was shown that C- or N-terminal truncation variants of 
5 the same epitope can trigger divergent T cell responses (Arnold et al., (2002) J. Immunol. 
169, 739-749). 

Several parameters can be envisaged that have an influence on the relative abundance 
of truncation variants of a particular epitope, e.g. the abundance and integrity of the 
antigen of relevance, antigen-associated proteins, the abundance of proteases, the type of 
10 proteases available and the supply with competitive antigens and/or peptides. Since the 
antigen supply is a major characteristic that may correlate with the origin of a sample, the 
ratio of particular truncation variants of an epitope can be of diagnostic value. 



A peptide of the invention is a peptide which either has no naturally-occurring 
15 counterpart (e.g., such as an mutated peptide antigen), or has been isolated, i.e., separated 
or purified from components which naturally accompany it, e.g., in tissues such as 
pancreas, liver, spleen, ovary, testis, muscle, joint tissue, neural tissue, gastrointestinal 
tissue, or body fluids such as blood, serum, synovial fluid or urine. Typically, the peptide is 
considered "isolated" when a preparation comprising a peptide of the invention consists to 
20 at least 70%, by dry weight of said peptide and to less than 30% of the proteins and 
naturally-occurring organic molecules with which it is naturally associated. Preferably, a 
preparation of a peptide of the invention consists of at least 80%, more preferably at least 
90%, and most preferably at least 99%, by dry weight, the peptide of the invention. Since a 
peptide that is chemically synthesized is, by its nature, separated from the components that 
25 naturally accompany it, the synthetic peptide is "isolated". 

The invention further provides analogs of the antigenic peptide of the invention. The 
term analog includes any peptide which displays the functional aspects of these antigenic 
peptides comprising the binding capacity IC50 and the recognition by antibodies and cells 
30 of the immune system. Analogs exhibit essentially the same IC50 as the corresponding 
reference peptide. The term analog also includes conservative substitutions or chemical 
derivatives of the peptides. 

The term "analog" includes any polypeptide having an amino acid residue sequence 
substantially identical to the sequences described herein in which one or more residues 
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have been conservatively substituted with a functionally similar residue and which 
displays the functional aspects of the peptides as described herein. Examples of 
conservative substitutions include the substitution of one non-polar (hydrophobic) 
residue such as phenylalanine, tyrosine, isoleucine, valine, leucine or methionine for 
another, the substitution of one polar (hydrophilic) residue for another such as between 
arginine and lysine, between glutamine and asparagine, between threonine and serine, the 
substitution of one basic residue such as lysine, arginine or histidine for another, or the 
substitution of one acidic residue, such as aspartic acid or glutamic acid for another. 

The phrase "conservative substitution" also includes the use of a chemically 
derivatized amino acid in place of a non-derivatized amino acid. "Chemical derivative" 
refers to a subject polypeptide having one or more amino acids chemically derivatized by 
reaction of a functional side group. Examples of such derivatized molecules include for 
example^ those molecules in which free amino groups have been derivatized to form amine 
hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl 
groups, chloroacetyl groups, acetyl groups or formyl groups. Free carboxyl groups may be 
derivatized to form salts, methyl and ethyl esters or other types of esters or hydrazides. Free 
hydroxyl groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole 
nitrogen of histidine may be derivatized to form N-irn-benzylhistidine. Also included as 
chemical derivatives are those proteins or peptides, which contain one or more naturally- 
occurring amino acid derivative of the twenty standard amino acids. For examples: 4- 
hydroxyproline may be substituted for proline; 5-hydroxylysine may be substituted for 
lysine; 3-methylhistidine may be substituted for histidine; homoserine may be substituted 
for serine; and ornithine or citrulline may be substituted for lysine. 



The MHC class II antigenic peptides of the invention and the proteins they are 
derived from can be used as markers in diagnosis of RA and in therapy as anti-RA 
vaccines. The term marker as used herein refers to a biomolecule, preferably a peptide or a 
polypeptide, which is expressed in a group of patients with a diagnosed disease, e.g. RA, 
and attains an abundance that is significantly increased or decreased as compared to a 
control group. 

The marker of the present invention may be used as a prognostic marker to predict 
the susceptibility to a disease, e.g., to predict the susceptibility to RA, as a diagnostic 
marker for the diagnosis of a disease, e.g. for the diagnosis of RA, as a differential 
diagnostic marker to differentiate between different forms of a disease, e.g., to differentiate 
between different forms of RA, as a prognostic marker for the prediction of the outcome of 



15 



a disease, e.g., for the prognosis of RA, and as a response marker to determine the 
efficacy of a therapeutic regime, e.g., as a response marker in the treatment of RA. 



In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 
5 the amino acid sequence of the peptide binding motif selected from the group consisting of 
SEQ ID NOs. 49 to 57, or (b) at least the amino acid sequence of the peptide binding 
motif selected from the group consisting of SEQ ID NOs. 49 to 57 with additional N-and 
C-terminal flanking sequences of a corresponding sequence selected from the group 
consisting of SEQ ID NOs. 1 to 39 is used as a marker for erosive and/or non-erosive RA. 

10 In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 

the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 49 with additional N- 
and C-terminal flanking sequences of a corresponding sequence selected from the group 
consisting of SEQ ID NOs. 1 to 3 is used as a marker for non-erosive RA. 

15 In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 

the amino acid sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 50 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5 is used 
as a marker for erosive RA. 

20 

In a further embodiment, the MHC class II antigenic peptides of the invention as 
described above are provided linked to a MHC class II molecule. 

Multimers (e.g., dimers, trimers, tetramers, pentamers, hexamers or oligomers) of a 
class II MHC molecule containing a covalently or non-covalently bound peptide according 

25 to the present invention, if conjugated with a detectable label (e.g., a fluorescent moiety, a 
radionuclide, or an enzyme that catalyzes a reaction resulting in a product that absorbs or 
emits light of a defined wavelength) can be used to quantify T cells from a subject (e.g., a 
human patient) bearing cell surface receptors that are specific for, and therefore will bind, 
such complexes. Relatively high numbers of such T cells are likely to be diagnostic of 

30 disease or an indication that the T cells are involved in immunity to the disease. In 
addition, continuous monitoring of the relative numbers of multimer-binding T cells can 
be useful in establishing the course of a disease or the efficacy of therapy. Such assays have 
been developed using tetramers of class I MHC molecules containing an HIV-l-derived or 
an influenza virus-15 derived peptide (Altaian et al. (1996), Science 274:94-96; Ogg et al. 
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(1998), Science 279:2103- 21061), and corresponding class II MHC multimers 
would be expected to be similarly useful. Such complexes could be produced by 
chemical cross-linking of purified class II MHC molecules assembled in the presence of a 
peptide of interest or by modification of already established recombinant techniques for 

5 the production of class II MHC molecules containing a single defined peptide (Kazono et 
al. (1994), Nature 369:151-154; Gauthier et al. (1998), Proc. Nad. Acad. ScL U.S.A. 
95:11828-118331). The class II MHC molecule monomers of such multimers can be native 
molecules composed of full-length alpha and beta chains. Alternatively, they can be 
molecules containing either the extracellular domains of the alpha and beta chains or the 

10 alpha and beta chain domains that form the "walls" and "floor" of the peptide-binding cleft. 

The invention also relates to an antibody, fragments or derivatives thereof, directed 
to and reactive with the above-described MHC class II antigenic peptides. The general 
methodology for producing antibodies is well known and is disclosed per example in 
Kohler and Milstein, 1975, Nature 256,494 or in J. G. R. Hurrel, Monoclonal Hybridoma 
15 Antibodies: Techniques and Applications, CRC Press Inc., Boco Raron, FL (1982). The 
antibodies can be polyclonal or, preferably, monoclonal, or antibody fragments like be F 
(ah') 2, Fab, Fv or scFv. The antibodies of the present invention may also be humanized 
(Merluzzi S. et al., (2000), Adv. Clin. Path., 4(2): 77-85) or human antibodies (Aujame L. 
et al., Hum. Antibodies, (1997), 8(4): 155-168). 

20 The present invention also provides a nucleic acid molecule encoding a MHC class II 

antigenic peptide of the invention comprising (a) at least the amino acid sequence of the 
peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57, or (b) 
at least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57 with additional N- and C-terminal flanking sequences 

25 of a corresponding sequence selected from the group consisting of SEQ ID NOs. 1 to 39. 
Preferably, the nucleic acid molecule is a DNA molecule. 

Furthermore, a nucleic acid molecule is provided encoding a MHC class II antigenic 
peptide of the invention linked to a MHC class II molecule. 

This invention also provides a recombinant nucleic acid construct comprising the 
30 nucleic acid molecules as described above, operably linked to an expression vector. 
Expression vectors suitable for use in the present invention comprise at least one 
expression control element operably linked to the nucleic acid sequence encoding the 
antigenic peptide or the antigenic peptide linked to a MHC class II molecule. The 
recombinant expression construct maybe a DNA construct. 



17 



The expression control elements are inserted in the vector to control and 
regulate the expression of the nucleic acid sequence encoding the antigenic peptide of 
the invention. Examples of expression control elements include, but are not limited to, lac 
system, operator and promoter regions of phage lambda, yeast promoters and promoters 

5 derived from polyoma, adenovirus, retrovirus or SV40. Additional preferred or required 
operational elements include, but are not limited to, leader sequence, termination codons, 
polyadenylation signals and any other sequences necessary or preferred for the appropriate 
transcription and subsequent translation of the nucleic acid sequence in the host system. It 
will be understood by one skilled in the art that the correct combination of required or 

10 preferred expression control elements will depend on the host system chosen. It will 
further be understood that the expression vector should contain additional elements 
necessary for the transfer and subsequent replication of the expression vector containing 
the nucleic acid sequence in the host system. Examples of such elements include, but are 
not limited to, origins of replication and selectable markers. It will further be understood 

15 by one skilled in the art that such vectors are easily constructed using conventional 
methods (wwwxellbio.com/protocols.html) or are commercially available. 

Another aspect of this invention relates to a host organism or a host cell into which, a 
recombinant nucleic acid construct comprising the nucleic acid molecules as described 
above, operably linked to an expression vector, has been inserted. The host cells 

20 transformed with the nucleic acid constructs encompassed by this invention include 
eukaryotes, such as animal, plant, insect and yeast cells and prokaryotes, such as E. coli. 
The means by which the nucleic acid construct carrying the nucleic acid sequence may be 
introduced into the cell include, but are not limited to, microinjection, electroporation, 
transduction, or transfection using DEAE-dextran, lipofection, calcium phosphate or other 

25 procedures known to one skilled in the art (Sambrook et al. (1989) in "Molecular Cloning. 
A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). 

In a preferred embodiment, eukaryotic expression vectors that function in 
eukaryotic cells are used. Examples of such vectors include, but are not limited to, 
retroviral vectors, vaccinia virus vectors, adenovirus vectors, herpes virus vector, fowl pox 
30 virus vector, plasmids, or the baculovirus transfer vectors. Preferred eukaryotic cell lines 
include, but are not limited to, COS cells, CHO cells, HeLa cells, NIH/3T3 cells, 293 cells 
(ATCC# CRL15731), T2 cells, dendritic cells, monocytes or Epstein-15 Barr Virus 
transformed B cells. 

An antigenic peptide of the invention can be obtained, for example, by extraction 
35 from a natural source (e.g., elution from MHC II molecules); by expression of a 
recombinant nucleic acid encoding the peptide; or by chemical synthesis. A peptide that is 
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produced in a cellular system different from the source from which it naturally 
originates is "isolated," because it will be separated from components which naturally 
accompany it The recombinant peptide expressed by a host organism can be obtained as a 
crude lysate or can be purified by standard protein purification procedures known in the 

5 art which may include differential precipitation, size exclusion chromatography, ion- 
exchange chromatography, isoelectric focusing, gel electrophoresis, affinity, and 
immunoaffinity chromatography and the like. The extent of isolation or purity can be 
measured by any appropriate method, e.g. mass spectrometry or HPLC analysis. The 
peptides may be prepared synthetically by procedures described in Merrifield, (1986) 

10 Science 232: 341-347, and Barany andMerrifield, The Peptides, Gross and Meienhofer, eds 
(N. Y., Academic Press), pp. 1-284 (1979). The synthesis can be carried out in solution or 
in solid phase or with an automatized synthesizer (Stewart and Young, Solid Phase Peptide 
Synthesis, 2nd ed., Rockford 111., Pierce Chemical Co. (1984)). 

Therefore, the present invention further provides a method for producing a MHC 
15 class II antigenic peptide comprising (a) at least the amino acid sequence of the peptide 
binding motif selected from the group consisting of SEQ ID NOs. 49 to 57, or 
(b) at least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57 with additional N-and C-terminal flanking sequences 
of a corresponding sequence selected from the group consisting of SEQ ID NOs. 1 to 39, 
20 comprising the steps of culturing the host cell containing a recombinant nucleic acid 
construct as described above under conditions allowing expression of said peptide and 
recovering the peptide from the cells or the culture medium. 



In a further embodiment of the present invention, a method is provided for isolating 
25 and identifying MHC class II associated RA antigenic peptides in femtomolar amounts, 
which method comprises (a) providing immature dendritic cells in a number comprising 
0.1 to 5 jig MHC class II molecules; (b) contacting the cells of (a) with serum or synovial 
fluid and inducing maturation of dendritic cells by adding TNFalpha; (c) isolating class II 
MHC molecule-antigenic peptide complexes from the cells with methods comprising 
30 solubilization of the cells and sequestration of the complexes of MHC class II molecules 
with antigenic peptides by immunoprecipitation or immunoaffinity chromatography; (d) 
washing the sequestered complexes of MHC class II molecules with antigenic peptides with 
water in an ultrafiltration tube; (e) eluting the associated antigenic peptides from the MHC 
class II molecules at 37°C with diluted trifluoro acetic acid, and 
35 (f) separating, detecting and identifying the isolated peptides by liquid chromatography 
and mass spectrometry. Furthermore, in step (f) of the method, the liquid chromatography 
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comprises a first linear elution step from the reversed-phase material with a volume 
sufficient to elute the majority of contaminants prior to peptide elution. 
Moreover, the method may further comprise (g) analyzing the identified peptides by 
methods comprising a database and a software developed to perform comparative data 
5 analysis across multiple datasets. 

The amount of tissue or bodily fluid necessary to obtain e.g. 100 ng MHC class II 
molecules depends on the number of cells that do express MHC class II and on the 
expression rate of MHC class II molecules: e.g. 100 ng of MHC class II are equivalent to 
about 2 x 10 5 mature DCs or 5 to 10 x 10 6 peripheral blood monocytes or about 5 x 10 7 
10 peripheral blood mononuclear cells which can be obtained from about 50 ml of blood. 

For the purification of class II MHC molecule-antigenic peptide complexes from 
cells or tissue, the membranes of the cells or tissue have to be solubilized. Cell lysis may be 
carried out with methods known in the art, e.g. freeze-and-thaw cycles and the use of 
detergents, and combinations thereof. Preferred lysis methods are solubilization using 
15 detergents, preferably TX-100, NP40, n-octylglucoside, Zwittergent, Lubrol, CHAPS, most 
preferably TX-100 or Zwittergent 3-12. Cell debris and nuclei have to be removed from cell 
lysates containing the solubilized receptor-peptide complexes by centrifugation. Therefore, 
the complexes of class II MHC molecules with antigenic peptides are isolated from the cells 
with methods comprising solubilization with a detergent. 

20 Furthermore, the MHC class II molecule-peptide complexes are purified from cell 

lysates by methods comprising immunoprecipitation or immunoaffinity chromatography. 
For the immunoprecipitation or immunoaffinity chromatography, antibodies specific for 
MHC class II molecules and suitable for these methods are used. The specific antibodies 
are preferably monoclonal antibodies, and are covalendy or non-covalently e.g. via Protein 

25 A, coupled to beads, e.g. sepharose or agarose beads. A selection of the broad panel of anti- 
HLA antibodies used in the prior art comprises: 

anti-HLA-DR antibodies: L243, TU36, DA6.147, preferably L243; anti-HLA-DQ 
antibodies: SPVL3, TU22, TU169, preferably TU22 and TU169; anti-HLA-DP antibody 
B7/21 and anti-HLA-A,B,C antibodies W6/32 and B9.12. 

30 Monoclonal antibodies specific for different MHC class II molecules may be 

commercially obtained (e.g. Pharmingen, Dianova) or purified from the supernatant of the 
respective hybridoma cells using Protein A- or Protein G- affinity chromatography. 
Purified monoclonal antibodies may be coupled by various methods known in the art, 
preferably by covalendy coupling antibody amino groups to CNBr-activated sepharose. 
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Immunoisolation of MHC molecules may be performed by incubating the 
antibody-beads with the cell iysate under rotation for several hours or 
chromatographically by pumping the cell lysate through a micro-column. Washing of the 
antibody-beads may be performed in eppendorf tubes or in the microcolumn. The efficacy 
5 of the immunoprecipitation may be analysed by SDS-PAGE and western blotting using 
antibodies recognizing denatured MHC molecules (anti-HLA-DRalpha: 1B5; anti-HLA 
class I: HC10 or HCA2). 

The sequestered MHC class II molecule-peptide complexes are washed with water or 
low-salt buffer before elution in order to remove residual detergent contaminants. The low 

10 salt buffer may be a Tris, phosphate or acetate buffer in a concentration range of 0.5 - 10 
mM, preferably in a concentration of 0.5 mM. In a more preferred embodiment, the MHC 
class II molecule -peptide complexes are washed with ultrapure water (sequencing grade) 
conventionally used for HPLC analysis, preferably with ultrapure (sequencing grade) water 
from MERCK. The washing step may be carried out by ultrafiltration. The ultrafiltration 

15 may be carried out in an ultrafiltration tube with a cut-off of 30 kD, 20 kD, 10 kD or 5 kD, 
preferably of 30 kD and a tube volume of 0.5 - 1.0 ml ("Ultrafree" tubes; Millipore). The 
washing in the ultrafiltration tube may be carried out 4 to 12 times, preferably 6 to 10 
times, with a volume of 10 to 20 times the volume of the beads carrying the receptor- 
peptide complexes, preferably with a volume of 15 times the beads. The eluted peptides 

20 may be separated from the remaining MHC class II molecules using the same 
ultrafiltration tube. The eluted peptides may then be lyophilized. 

By eluting the peptides from the MHC class II molecules, a complex mixture of 
naturally processed peptides derived from the source of potential antigen and from 
polypeptides of intra- or extracellular origin, is obtained. Only after elution, peptides can 
25 be separated and subjected to sequence analysis. 

The antigenic peptides in the method of the present invention may be eluted by a 
variety of methods known in the art, preferably by using diluted acid, e.g., diluted 
acetonitrile (Jardetzky TS et al., Nature 1991 353, 326-329), diluted acetic acid and heating 
(Rudensky AY et al., Nature 1991, 353, 622-626; Chicz RM et al., Nature 1992, 358, 764- 
30 768) or diluted trifluoro acetic acid at 37°C (Kropshofer H et al., J Exp Med 1992, 175, 
1799-1S03). Most preferably, the peptides are eluted at 37°C with diluted trifluoro acetic 
acid. 

The isolated antigenic peptides are then separated, detected and identified. By 
detecting it is understood that the amino acid sequence of the individual peptides in the 
35 mixture of isolated antigenic peptides is elucidated by methods adequate to detect and 
sequence femtomolar amounts of peptides. By identifying it is understood that it is 
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established from which proteins or polypeptides the antigenic peptides are 
derived and which sequence they constitute within these proteins or polypeptides. 

In a first step, the complex mixture of eluted peptides may be separated by one of a 
variety of possible chromatographic methods, e.g. by reversed phase, anion exchange, 
5 cation exchange chromatography or a combination thereof. Preferably, the separation is 
performed by C18-reverse phase chromatography or by reversed-phase / cation exchange 
two-dimensional HPLC, denoted as MudPit (Washburn MP et al., Nat BiotechnoL, (2001), 
19, 242-247). 

The separation is done in a HPLC mode utilizing fused-silica micro-capillary 
10 columns which are either connected to a nano-flow electrospray source of a mass 
spectrometer or to a micro-fractionation device which spots the fractions onto a plate for 
MALDI analysis. 

Liquid chromatography comprises peptide fractionation by the use of a strong ion 
exchange material and a hydrophobic reversed-phase material. For the elution of the 

15 peptides from the ion exchange and reversed-phase material different elution programs are 
run one after another comprising elutions with salt and with organic solvents, e.g., 
acetonitrile. The elution from the reversed-phase material is conducted in several steps of 
linear gradients of different lengths and slopes. A contamination in the sample to be 
fractionated may be any contamination whose elution competes with the detection of the 

20 peptide peaks in the mass spectrometer. Therefore, in order to prevent simultaneous 
elution, the contaminants have to be eluted with a sufficient solvent volume prior to the 
peptide elution step. Depending on the column used for liquid chromatography the 
solvent volume sufficient to elute the contaminants prior to the peptide elution step may 
be 100 to 200 times the column volume. 

25 A variety of mass spectrometric techniques are suitable, preferably MALDI-post 

source decay (PSD) MS or electrospray ionization tandem mass spectrometry (ESI-MS), 
most preferably ion-trap ESI-MS. * 

The sequences of the individual peptides can be determined by means known in the 
art. Preferably, sequence analysis is performed by fragmentation of the peptides and 
30 computer-assisted interpretation of the fragment spectra using algorithms, e.g. MASCOT 
or SEQUEST. Both computer algorithms use protein and nucleotide sequence databases to 
perform cross-correlation analyses of experimental and theoretically generated tandem 
mass spectra. This allows automated high through-put sequence analysis. 
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The isolated and identified antigenic peptides of the invention can be validated 
by the MHC binding motif, the MHC binding capacity and/or by T cell 
recognition, 

MHC binding motif 

5 Peptides associated to a particular MHC molecule (allelic variant) have common 

structural characteristics, denoted as binding motifs, necessary to form stable complexes 
with MHC molecules. Peptide ligands eluted from MHC class I molecules are relatively 
short, ranging from 8-11 amino acids. Moreover, 2 or 3 side chains of the peptide are 
relevant for binding. The position of the respective amino acid side chains varies with the 

10 HLA allele, most often two of these so-called "anchor" residues are located at positions 2 
and 9. With respect to a particular anchor position, only 1 or 2 amino acids normally can 
function as anchor amino acids e.g. leucine or valine V at position 2 in the case of HLA-A2. 

In the case of MHC class II molecules, the peptide length varies from 11 to 25 amino 
acids, as longer peptides can bind since both ends of the peptide binding groove are open. 

15 Most HLA class II molecules accommodate up to 4 anchor residues at relative positions PI, 
P4, P6 and P9 contained in a nonameric core region. This core region, however, can have 
variable distance from the N-terminus of the peptide. In the majority of cases, 2-4 N- 
terminal residues precede the core region. Hence, the PI anchor residues is located at 
positions 3, 4 or 5 in most HLA class II associated peptides. Peptides eluted from HLA-DR 

20 class II molecules share a big hydrophobic PI anchor, represented by tyrosine, 
phenylalanine, tryptophane, methionine, leucine, isoleucine or valine. 

The position anci the exact type of anchor residues constitute the peptide binding 
motif which is known for most of the frequently occurring HLA class II allelic products. A 
computer algorithm allowing motif validation in peptide sequences is "Tepitope", available 
25 by vaccinome ( www.vaccinome.comy 

MHC binding capacity 

Peptides identified by the method of the invention may be tested for their ability to 
bind to the appropriate MHC class II molecule by methods known in the art using, for 
example, isolated MHC class II molecules and synthetic peptides with amino acid 
30 sequences identical to those identified by the method of the invention (Kropshofer H et al., 
J. Exp. Med. 1992; 175, 1799-1803; Vogt AB et al., J. Immunol. 1994; 153, 1665-1673; Sloan 
VS et al., Nature 1995; 375, 802-806). Alternatively, a cellular binding assay using MHC 
class II expressing cell lines and biotinylated peptides can be used to verify the identified 
epitope (Arndt SO et al., EMBO J., 2000; 19, 1241-1251) 
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In both assays, the relative binding capacity of a peptide is measured by 
determining the concentration necessary to reduce binding of a labelled reporter 
peptide by 50%. This value is called IC 5 o- Peptide binding with a reasonable affinity to the 
relevant HLA class II molecules attain IC50 values not exceeding 10-fold the IC50 of 
5 established reference peptides. 

The same binding assays can also be used to test the ability of peptides to bind to 
alternative class II MHC molecules, i.e., class II MHC molecules other than those from 
which they were eluted using the method of the invention. The diagnostic methods of the 
invention using such peptides and therapeutic methods of the invention, using either the 
10 peptides or peptides derived from them, can be applied to subjects expressing such 
alternative class II MHC molecules. 

T cell recognition 

The epitope verification procedure may involve testing of peptides identified by the 
method of the invention for their ability to activate CD4+ T cell populations. Peptides with 

15 amino acid sequences either identical to those identified in the present invention or 
corresponding to a core sequence derived from a nested group of peptides identified in the 
present invention are synthesized. The synthetic peptides are then tested for their ability to 
activate CD4+ T cells from (a) test subjects expressing the MHC class II molecule of 
interest and having at least one symptom of the disease; and (b) control subjects expressing 

20 the MHC class II molecule of interest and having no symptoms of the disease. Additional 
control subjects can be those with symptoms of the disease and not expressing the MHC 
class II molecule of interest. 

In some diseases (e.g., those with an autoimmune component) responsiveness in the 
CD4+ T cells of test subjects but not in CD4+ T cells of the control subjects described in 

25 (b) provides confirmatory evidence that the relevant peptide is an epitope that activates 
CD4+ T cells that can initiate, promote, or exacerbate the relevant disease. In other 
diseases (e.g., cancer or infectious diseases without ah autoimmune component), a similar 
pattern of responsiveness and non-responsiveness to that described in the previous 
sentence would indicate that the relevant peptide is an epitope that activates CD4+ T cells 

30 that can mediate immunity to the disease or, at least, a decrease in the symptoms of the 
disease. 

CD4+ T cell responses can be measured by a variety of in vitro methods known in the 
art. For example, whole peripheral blood mononuclear cells (PBMC) can be cultured with 
and without a candidate synthetic peptide and their proliferative responses measured by, 
35 e.g., incorporation of [ 3 H] -thymidine into their DNA. That the proliferating T cells are 
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CD4+ T cells can be tested by either eliminating CD4+ T cells from the PBMC 
prior to assay or by adding inhibitory antibodies that bind to the CD4+ molecule 
on the T cells, thereby inhibiting proliferation of the latter. In both cases, the proliferative 
response will be inhibited only if CD4+ T cells are the proliferating cells. Alternatively, 
5 CD4+ T cells can be purified from PBMC and tested for proliferative responses to the 
peptides in the presence of APC expressing the appropriate MHC class II molecule. Such 
APC can be B -lymphocytes, monocytes, macrophages, or dendritic cells, or whole PBMC. 
APC can also be immortalized cell lines derived from B-lymphocytes, monocytes, 
macrophages, or dendritic cells. The APC can endogenously express the MHC class II 
10 molecule of interest or they can express transfected polynucleotides encoding such 
molecules. In all cases the APC can, prior to the assay, be rendered non-proliferative by 
treatment with, e.g., ionizing radiation or mitomycin-C. 

As an alternative to measuring cell proliferation, cytokine production by the CD4+ T 
cells can be measured by procedures known to those in art. Cytokines include, without 
15 limitation, interleukin-2 (IL-2), interferon-gamma (IFN-gamma), interleukin-4 (IL-4), 
TNF-alpha, interleukin-6 (IL-6), interleukin-10 (IL-10), interleukin-12 (IL-12) or TGF- 
beta. Assays to measure them include, without limitation, ELISA, and bio-assays in which 
cells responsive to the relevant cytokine are tested for responsiveness (e.g., proliferation) in 
the presence of a test sample. 

20 Alternatively, cytokine production by CD4+ lymphocytes can be direcdy visualized 

by intracellular immunofluorescence staining and flow cytometry. 



Moreover, the MHC class II antigenic peptides of the present invention may be used 
in the diagnosis of RA. Therefore, a further embodiment of the invention is the use of an 
25 antigenic peptide according to the present invention as a marker for RA. 

Preferably, a MHC class II antigenic peptide comprising (a) at least the amino acid 
sequence of the peptide binding motif selected from the group consisting of SEQ ID NOs. 
49 to 57, or (b) at least the amino acid sequence of the peptide binding motif selected from 
the group consisting of SEQ ID NOs. 49 to 57 with additional N-and C-terminal flanking 
30 sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 is used as a marker for RA. 

In another embodiment, the antigenic peptides of the invention may be used as 
response markers to track the efficacy of a therapeutic regime. Essentially, a baseline value 
for an antigenic peptide can be determined, then a given therapeutic agent is administered, 
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and the levels of the antigenic peptide are monitored subsequently, whereas a change 
in the level of the antigenic peptide is indicative of the efficacy of a therapeutic 
treatment. 

Furthermore, the antigenic peptides which are only found in certain stages or phases 
5 of a disease, preferably of RA, may be utilized as stage-specific markers. Essentially, the 
levels of the antigenic peptides which have been linked to a certain disease stage are 
monitored regularly, thereby providing information about the stage of the disease and its 
progression. 

The invention also includes the use of the polypeptides the RA antigenic peptides are 
10 derived from as markers for the diagnosis and monitoring of a disease, preferably of RA, 
and in particular, of erosive versus non-erosive RA. The rationale for the use of the 
respective* proteins is that DCs reside in most tissues where they capture exogenous 
antigens via specific receptors and via specialized endocytotic mechanisms (e.g. 
macropinocytosis) followed by presentation of the processed antigens as peptides on MHC 
15 class II molecules. Previous studies have shown that the frequency of a peptide epitope 
found in the context of MHC class II molecules, e.g. the RA antigenic peptides, in the 
majority of cases mirrors the abundance of the protein from which this particular peptide 
was derived from. Therefore, not only the RA antigenc peptides but also the corresponding 
proteins can serve as markers for RA. 

20 Therefore, in a further embodiment of the present invention, the use of a 

polypeptide selected from the group consisting of interferon-gamma-inducible lysosomal 
thiol reductase (SEQ ID NO: 40), apolipoprotein B-100 (SEQ ID NO: 41), inter-alpha- 
trypsin inhibitor heavy chain H4 (SEQ ID NO: 42), complement C4 (SEQ ID NO: 43), 
complement C3 (SEQ ID NO: 44), SH3 domain-binding glutamic acid-rich-like protein 3 

25 (SEQ ID NO: 45), interleukin-4-induced protein 1 (SEQ ID NO: 46), hemopexin (SEQ ID 
NO: 47), Hsc70-interacting protein (SEQ ID NO: 48) as a marker for RA is provided. 
Preferably, the polypeptide is used as a marker for erosive RA. It is also preferred to use the 
polypetide as a marker for non-erosive RA. Especially preferred is the use of interleukin-4- 
induced protein 1 (SEQ ID NO: 46) as a marker for RA. The Figl polypeptide has not been 

30 known as a marker for RA until now, and is considered as an important candidate marker 
for RA. 

The diagnosis of RA can be made by examining expression and/or composition of a 
polypeptide or peptide marker for RA, by a variety of methods, including enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
35 immunofluorescence. A test sample from an individual is assessed for the presence of an 
alteration in the expression and/or an alteration in composition of a polypeptide or a 
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peptide of the present invention. An alteration in expression of a polypeptide or 
peptide can be, for example, an alteration in the quantitative polypeptide expression (i.e., 
the amount of polypeptide produced); an alteration in the composition of a polypeptide is 
an alteration in the qualitative polypeptide expression (e.g., expression of a mutant 
5 polypeptide or of a different splicing variant). 

Both such alterations (quantitative and qualitative) can also be present. An 
"alteration" in the polypeptide expression or composition, as used herein, refers to an 
alteration in expression or composition in a test sample, as compared with the expression 
or composition of the peptide or polypeptide in a control sample. A control sample is a 

10 sample that corresponds to the test sample (e.g., is from the same type of cells), and is from 
an individual who is not affected by RA. An alteration in the expression or composition of 
the peptide or polypeptide in the test sample, as compared with the control sample, is 
indicative of RA or a susceptibility to RA. Various means of examining expression or 
composition of a peptide or polypeptide of the present invention can be used, including 

15 spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and immunoassays (e.g., 
David et al> U.S. Pat. No. 4,376,110) such as immunoblotting (see also Current Protocols 
in Molecular Biology, particularly chapter 10). For example, in one embodiment, an 
antibody capable of binding to the polypeptide (e.g., as described above), preferably an 
antibody with a detectable label, can be used. Antibodies can be polyclonal, or more 

20 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g.> Fab or F(ab')2) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 

25 labeling include detection of a primary antibody using a fluorescendy labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. 

Western blotting analysis, using an antibody as described above that specifically 
binds to a peptide or polypeptide of the present invention, may be used to measure the 

30 level or amount, of a peptide or polypeptide in a test sample and comparing it with the level 
or amount of the peptide or polypeptide in a control sample. Preferably the peptide or 
polypeptide in a test sample is measured in a homogenous or a heterogenous immuno 
assay. A level or amount of the polypeptide in the test sample that is higher or lower than 
the level or amount of the polypeptide in the control sample, such that the difference is 

35 statistically significant, is indicative of an alteration in the expression of the polypeptide, 
and is diagnostic for a RA or a susceptibility to RA. 
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Therefore, the present invention also relates to a diagnostic composition 
comprising an antibody reactive with a MHC class II antigenic peptide of the 
invention. 

In a further embodiment the antigenic peptides of the invention or the proteins they 
are derived from may be used in the prevention and treatment of a disease, preferably of 
RA. 

One aspect of the invention is a therapeutic purpose, wherein one or more of the 
identified antigenic peptides are used to vaccinate patients against RA, preferably against 
erosive and/or non-erosive RA. In the course of the vaccination the antigenic peptide 
would induce an antigen-specific T cell tolerance in the patient which would ultimately 
lead to regression of the disease or to an attenuation of disease development. 

A promising strategy to induce specific immune tolerance in future clinical trials is 
the use of DNA tolerizing vaccines. DNA tolerizing vaccines encoding autoantigens alone 
were shown to reduce T cell proliferative responses (Ruiz, P. et aL, J Immunol 162 (1999) 
3336-3341), while DNA tolerizing vaccines co-delivering autoantigen plus IL-4 also 
induced protective T H 2 responses (Garren, H. et aL, Immunity 15 (2001) 15-22). Examples 
of non-polynucleotide-specific tolerizing therapies under development include protein 
antigens, naturally processed peptides, altered peptide ligands, other biomolecules, such as 
DNA, or proteins and peptides containing posttranslational modifications, and antigens 
delivered orally to induce "oral tolerance" (reviewed in: Robinson, W.H. et aL, Clin 
Immunol 103 (2002) 7-12). A potential adverse effect with regard to tolerizing therapies is 
the development of autoimmunity. 

To this end, the relevant RA antigenic peptides may be directly administered to the 
patient in an amount sufficient for the peptides to bind to the MHC molecules, and 
provoke peripheral tolerance of T cells. 

Alternatively, the antigenic peptides of the invention may be utilized for the 
generation of vaccines based on DCs. In this case, autologous DCs derived from patients' 
monocytes may be pulsed with the relevant peptides or recombinant proteins containing 
the relevant peptide sequences. 

Therefore, the present invention provides a pharmaceutical composition comprising 
a MHC class II antigenic peptide comprising (a) at least the amino acid sequence of the 
peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57, or 
(b) at least the amino acid sequence of the peptide binding motif selected from the group 
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consisting of SEQ ID NOs. 49 to 57 with additional N-and C-terminal flanking 
sequences of a corresponding sequence selected from the group consisting of SEQ 
ID NOs. 1 to 39, an antibody reactive with said antigenic peptide, or a polypeptide selected, 
from the group consisting of SEQ ID NOs 40 to 48, and optionally a pharmaceutical^ 
5 acceptable excipient, diluent or carrier. The antigenic peptide has to be present in an 
amount sufficient to tolerize specific lymphocytes. Such an amount will depend on the 
peptide used, the administration, the severity of the disease to be treated and the general 
conditions of the patient and will usually range from 1 to 50 mg/ml, for example in case of 
peptides being loaded on dendritic cells. 

10 An acceptable excipient, diluent or carrier may be phosphate buffered saline for in 

vitro studies and physiological salt solutions for in vivo applications. 

"Vaccination" herein means both active immunization, i. e. the in vivo 
administration of the peptides to elicit an in vivo immune tolerance directly in the patient 
and passive immunization, i. e. the use of the peptides to tolerize in vitro CD4+ 
15 T lymphocytes or to stimulate autologous or allogeneic dendritic cells, which are 
subsequently re-inoculated into the patient. 

The present invention also provides the antigenic peptides, antibodies, nudeic acids, 
host cells, methods, compositions and uses substantially as herein before described 
especially with reference to the Examples. 

20 Having now generally described this invention, the same will become better 

understood by reference to the specific examples, which are included herein for purpose of 
illustration only and are not intended to be limiting unless otherwise specified, in 
connection with the following figures. 
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Examples 

The examples below are illustrated in connection with the figures described above 
and based on the methodology summarized in Figl, as described in the following. 
Commercially available reagents referred to in the examples, were used according to 
5 manufacturer's instructions unless otherwise indicated. 



Methodology of the invention 
Dendritic cells and culturing 

The study was performed with human dendritic cells which were differentiated from 
10 monocytes, as described below. Monocytes were purified from human peripheral blood. 
The blood was taken from healthy donors with the following haplotypes: (l)HLA- 
DRB1*0401, *030U, (2) HLA-DRB1*040U *0304, (3) HLA-DRB1*0401, *1301, (4) HLA- 
DRB1*0401,*0701. 

All cells were cultured in RPMI 1640 medium (short: RPMI) supplemented with 1 
15 mM Pyruvate, 2 mM Glutamine and 10% heat-inactivated fetal calf serum (Gibco BRL, 
Rockville, MD). 

Isolation of peripheral blood mononuclear cells (PBMCs) 

Peripheral blood was obtained from the blood bank in Mannheim, Germany as 
standard buffy coat preparations from healthy donors. Heparin (200 LU./ml blood, 

20 Liquemine, Roche) was used to prevent clotting. Peripheral blood mononuclear cells 
(PBMCs) were isolated by centrifugation in LSM® (1.077-1.080 g/ml; ICN, Aurora, pH) at 
800g (room temperature) for 30 min. PBMCs were collected from the interphase and 
washed twice in RPMI containing 20 mM Hepes (500g for 15 min, 300g for 5 min). In 
order to remove erythrocytes, PBMCs were treated with ALT buffer (140 mM ammonium 

25 chloride, 20 mM Tris, pH 7.2) for 3 min at 37«C. PBMCs were washed twice with RPMI 
containing 20 mM Hepes (200g for 5 min). 

Generation of dendritic cells from peripheral blood monocytes. 

Monocytes were isolated from PBMCs by positive sorting using anti-CD 14 magnetic 
beads (Miltenyi Biotech, Auburn, CA) according to the manufacturer's protocol. 
30 Monocytes were cultured in RPMI supplemented with 1% non-essential amino acids 
(Gibco, BRL, Rockville, MD), 50 ng/ml recombinant human granulocyte macrophage- 
colony stimulating factor (GM-CSF; S.A. l.lxlO 7 U/mg) (Leucomax; Novartis, Basel 
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Switzerland) and 3 ng/ml recombinant human IL-4 (S.A. 2.9xl0 4 U/\lg) (R&D 
Systems, Minneapolis, MN). Monocytes were seeded at 0.3 x 10 6 /ml in 6- well plates 
(Costar) for 5 days to obtain immature dendritic cells. 

The quality of monocyte-derived immature dendritic cells was routinely monitored 
5 by flow- cytometric analysis and assessed to be appropriate when they displayed the 
following phenotype: CDla (high), CD3 (neg.), CD14 (low), CD19 (neg.), CD56 (neg.), 
CD80 (low), CD83 (neg.), CD86 (low) and HLA-DR (high). In contrast, mature dendritic 
cells (cf. below) display the following phenotype: CDla (low), CD80 (high), CD83 (high), 
CD86 (high) and HLA-DR (high). Monoclonal antibodies against CDla, CD3, CD14, 
10 CD19, CD56, CD80, CD83, CD86 as well as the respective isotype controls were purchased 
from Pharmingen (San Diego, CA). 

Exposure of dendritic cells to serum or synovial fluid 

Serum and synovial fluid were irradiated for 30 min with 137 Cs (70 TBq). To feed 
dendritic cells with serum- or synovia-derived antigen, 6 x 10 6 immature dendritic cells 
15 were pulsed with either 1 ml serum or 0.6 ml synovial fluid. At the same time maturation 
of dendritic cells was induced by adding 10 ng/ml recombinant human tumor necrosis 
factor alpha (TNFoc; S.A. 1.1x10 s U/|ig). As a control, 6 x 10 6 immature dendritic cells were 
incubated with TNFoc alone. 

After 24 hrs in culture, mature dendritic cells were harvested by centrifugation at 
20 300g for 10 min. Cells were washed with PBS and transferred to an eppendorf tube. After 
centrifugation at 400g for 3 min, the supernatant was completely removed and the cells 
were frozen at -70°C. 

Generation of anti-HLA class II beads 

The anti-HLA-DR monoclonal antibody (mAb) L243 (ATCC, Manassas, VA) was 
25 produced by culturing the respective mouse hybridoma cell line. mAb L243 was purified 
using ProteinA sepharose (Pharmacia, Uppsala, Sweden) and immobilized to CNBr- 
activated sepharose beads (Pharmacia) at a final concentration of 2.5 mg/ml, according to 
the manufacturers protocol. L243 beads were stored in PBS containing 0.1% Zwittergent 
3-12 (Calbiochem, La Jolla, CA). 
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Nano-scale purification of HLA-DR- peptide complexes 

Pellets of frozen dendritic cells were resuspended in 10-fold volume of ice cold lysis 
buffer (1% Triton-X-100, 20 mM Tris, pH 7.8, 5 mM MgCl 2 , containing protease 
inhibitors chymostatin, pepstatin, PMSF and leupeptin (Roche, Mannheim, Germany)) 
and lysed in a horizontal shaker at 1000 rpm, 4°C for 1 h. The cell lysate was cleared from 
cell debris and nuclei by centrifiigation at lOOOOg, 4°C for 10 min. The lysate was co- 
incubated with L243 beads (5-10 |il L243 beads per 100 ^1 cell lysate) in a horizontal shaker 
at 1000 rpm, 4°C for 2 hrs. Immunoprecipitated HLA-DR-peptide complexes bound to 
L243 beads were sedimented by centrifiigation at lOOOg, 4°C for 1 min and washed four 
times with 500 |il 0.1% Zwittergent 3-12 (Calbiochem) in PBS. 

The efficacy of depletion of HLA-DR-peptide complexes was monitored by analyzing 
the respective cell lysates before and after immunoprecipitation and aliquots of the beads 
by western blotting using the anti-HLA-DRa-specific mAb 1B5 (Adams, T.E. et al., 
Immunology 50 (1983) 613-624). 

Elution of HLA-DR-associated peptides 

HLA-DR-peptide complexes bound to L243 beads were resuspended in 100 |jl H 2 0 
(HPLC-grade; Merck, Darmstadt, Germany), transferred to an ultrafiltration tube, 
Ultrafree MC, 30 kD cut-off (Millipore, Bedford, MA) and washed 10 times with 100 |il 
H 2 0 (HPLC-grade) by centrifiigation for 1-2 min at lOOOOg at RT. For eluting the bound 
peptides, 60 nl 0.1% trifluoracetic acid (Fluka, Buchs, Switzerland) in H 2 0 (HPLC-grade) 
was added and incubation was performed for 30 min at 37°C. Eluted peptides were 
collected in a new eppendorf tube by centrigugation of the Ultrafree unit at lOOOOg for 3 
min at RT and immediately lyophilized in a Speed-Vac™ vacuum centrifuge. 

Fractionation by two-dimensional nanoflow LC 

To perform high-throughput sequencing of complex peptide mixtures, the MudPIT 
(multidimensional protein identification technology) was used (Washburn, M.P. et al., Nat 
Biotechnol 19 (2001), 242-247) which is based on liquid chromatographic fractionation 
followed by mass spectrometric sequence determination. 

To this end, lyophilized peptides eluted from HLA molecules were resuspended in a 
buffer containing 5% (v/v) acetonitrile (ACN), 0.5% (v/v) acetic acid, 0.012% (v/v) 
heptafluoro butyric acid (HFBA) and 1% (v/v) formic acid. The peptide mixture was 
fractionated on a fused-silica microcapillary column (100 \im i.d. X 375 ^m) generated by 
a Model P-2000 laser puller (Sutter Instrument Co., Novato, CA). The microcolumn was 
packed with 3 ^m / C18 reversed-phase material (C18-ACE 3 \xm [ProntoSIL 120-3-C18 
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ACE-EPS, Leonberg, Germany]) followed by 3 cm of 5 urn cation exchange material 
(Partisphere SCX; Whatman, Clifton, USA). 

A fully automated 8-step gradient separation on a LC Packings UltiMate HPLC (LC 
Packings, San Francisco, USA) was carried out, using the following buffers: 5%ACN / 
5 0.012% HFBA / 0.5% acetic acid (buffer A), 80% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer B), 250 mM ammonium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer C), and 1.5 M ammonium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer D). The first 116 min step consisted of a 75 min gradient from 0 to 40% buffer B 
followed by a 10 min gradient from 40 to 80% buffer B, a 6 min hold at 80% buffer B and a 

10 10 min equilibration step with 100% buffer A. The next* 5 steps (146 min each) were 
characterized by the following profile: 5 min 100% buffer A, 5 min gradient from 0 to x% 
buffer C, 5 min 100% buffer A, 30 min gradient from 0 to 10% buffer B, 55 min gradient 
from 10 to 35% buffer B, 20 min gradient from 35 to 50% buffer B, 10 min gradient from 
50 to 80% buffer B, a 6 min hold at 80% buffer B, and a 10 min equilibration step with 

15 100% buffer A. The buffer C percentages (x) in steps 2-6 were as follows: 20, 40, 60, 80, 
and 90%. The 30 min gradient from 0 to 10% buffer B, which is the first linear elution step 
from the reversed-phase material, was needed in order to sufficiently separate peptide 
elution from the elution of a major contaminant (m/z=945) which otherwise would have 
led to the loss of the more hydrophilic peptide peaks. Step 7 consisted of the following 

20 profile: 5 min 100% buffer A, 20 min 100% buffer C, 5 min gradient from 0 to 10% buffer 
B, 35 min gradient from 10 to 35% buffer B, 50 min gradient from 35 to 50% buffer B, 
10 min gradient from 50 to 80% buffer B, a 5 min hold at 80% buffer B and a 10 min 
equilibration step with 100% buffer A. Step 8 was identical to step 7 with the exception of 
using buffer D instead of buffer C. 

25 Ion trap MS/MS mass spectrometry 

The HPLC column was directly coupled to a Finnigan LCQ Deca XP Plus ion trap 
mass spectrometer (Thermo Finnigan, San Jose, USA) equipped with a nano-LC 
electrospray ionization source. Mass spectrometry in the MS/MS mode was performed 
according to the manufacturer's protocol. Peptides were identified by the SEQUEST 
30 algorithm (U.S. patents 6,017,693 and 5,538,897). 
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MALDI-TOF mass spectrometry 

Peptides spotted onto an AnchorChip plate were co-cristallized with matrix 
(5 mg/ml; a-cyano-4-hydroxy-cinnamic acid (Merck, Darmstadt, Germany), 50% 
acetonitrile, 0.1% trifluoroacetic acid). For qualitative analysis of the whole peptide 
repertoire, samples were analyzed on an Ultraflex™ MALDI-TOF mass spectrometer 
(Bruker, Bremen, Germany), according to the manufacturer's protocol. 

Sequence identification by SEQUEST and differential dataset analysis. 

MS/MS fragmentation data were analyzed with the software SEQUEST (Thermo 
Finnigan, San Jose, USA). From an in-house protein database, which was created based on 
the public databases Swiss-Prot and TrEMBL, SEQUEST extracted for each spectrum all 
peptide sequences that corresponded to the molecular mass of the parent ion and 
measured the degree of similarity between the experimental spectrum and the theoretical, 
in silico generated, spectrum. Only the top-scoring candidate sequence was listed. 

The peptide sequences derived from the SEQUEST analysis and their accompanying 
information on mass accuracy, scoring parameters and peptide origin were stored in an 
appropriately designed relational database and further processed. Certain constraints were 
enforced in order to guarantee the storage of only significant sequences with satisfying 
SEQUEST scores. The two most important constraints were: (i) keep only those sequences 
that have a cross correlation coefficient (CC) higher than a certain value and (ii) from the 
remaining sequences keep those ones which have a predefined delta cross correlation 
coefficient (ACC). For both criteria the minimum chosen values are based on empirical 
knowledge of interpreting SEQUEST results. 

A dataset was defined as the sum of data from a particular set of spectra. The design 
of database and software allowed queries on a single dataset as well as comparisons of 
multiple datasets. Such a database and software design enables comparative sample 
analysis, which is not provided by SEQUEST. For instance, possible queries on a single 
dataset could provide information on the score distribution among the stored spectra, on 
the existence of further sequence length variants or common subsequences, or on the 
protein origin of peptide sequences. Since the occurrence of truncation variants of the 
same epitope is a general characteristic of class II MHC-bound peptides, the existence of 
length variants in a dataset provides additional strong evidence for the presence of an 
epitope in a set of spectra. 

The most important feature in the analysis of multiple datasets is the possibility to 
extract a common subset of sequences that satisfies a given criterion. Such a criterion could 
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be based on sequence similarity, e.g., within all sequences of a collection of datasets, 
those sequences were selected that had at least one subsequence in common with any 
other sequence. Such comparisons across different datasets constitute the differential 
approach (RA samples versus control samples) and thereby optimize the search for 
5 candidate RA marker peptides. 

The pairwise similarity scores between sequences were calculated by a software 
routine, which is an implementation of a standard string-comparison algorithm. 
Subsequently, these scores were used to group closely-related sequences (sequences sharing 
a common subsequence) in well-separated clusters by an additionally developed software 
10 routine, which is based on a well-established algorithm (hierarchical clustering, UPGMA). 

The generated clusters (e.g. of peptide truncation variants) were then used to identify 
closely-related sequences across different datasets. 

Overall, the data evaluation software provided the ability to perform swiftly and 
reproducibly the following: 

15 -Select from the sequence output generated by SEQUEST those sequences that satisfy 

reliable empirical criteria. 

- Store the data in a database appropriately designed for the discovery process at 

hand. 

-Extract information about the sequence content of each stored dataset. This 
20 information is valuable in assessing the importance of individual sequences within the 
given dataset and, consequently, across multiple datasets. 

-Provide, by virtue of the multiple dataset comparisons, a tool that realizes the 
differential approach, namely the study of the actual sequence content of one sample 
versus other(s). 
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Example 1 

In this example, the technique mentioned in Figure 1 was used to identify novel 
HLA-DR-associated peptide markers derived from serum and synovial fluid of patients 
with non-erosive RA. 

5 6 x 10 6 immature dendritic cells were pulsed with either 1 ml serum (3 samples) or 

0.6 ml synovial fluid (2 samples) of patients with non-erosive RA and cultured for 24 hrs in 
the presence of lOng/ml TNFofe As a control, 6x 10 6 dendritic cells were cultured in the 
presence of TNFa (lOng/ml) without adding serum but 1 ml of PBS. In an additional 
experiment 6 x 10 6 dendritic cells were pulsed with 1 ml serum from 2 healthy test persons 

10 and cultured for 24 hrs in the presence of TNFa (10 ng/ml). 

Dendritic cells were lysed in detergent TX-100 and HLA-DR molecules were isolated 
using mAb L243. HLA-DR-associated peptides were eluted with 0.1% TFA and analyzed by 
high-throughput 2D-LC-MS/MS technology. Peptide identification was achieved by using 
the SEQUEST algorithm. The peptide sequences derived from the SEQUEST analysis and 
15 accompanying information on mass accuracy, scoring parameters and peptide origin were 
stored in a database and further processed. 

The peptide sequences identified from unpulsed DCs (control 1) and from DCs 
pulsed with the serum of healthy test persons (control 2) were compared with the peptide 
sequences identified from DCs pulsed with the serum of non-erosive RA patients. Among 
20 the RA-specific sequences, only those peptides were selected for further evaluation that re- 
occurred in at least three of five non-erosive RA samples. 

In each serum sample roughly 600+150 individual peptide sequences (cross 
correlation coefficient CC > 3.0 and ACC > 0.15) were identified. In the synovia samples 
the number of individual peptide sequences was slightly smaller (400±30). Approximately 

25 80-85% of the peptides found in RA samples were also identified in control samples, 
underlining the high reproducibility of the analysis. In the majority of cases, several length 
variants of the same epitope could be identified which is a typical characteristic of class II 
MHC-bound antigens and supports the validity of the results (Jones, E.Y., Curr Opin 
Immunol 9 (1997) 75-79). Further confidence in the quality of the data relies on the fact 

30 that several of the identified peptides or proteins have already been described in the 
context of MHC class II molecules: epitopes derived from ubiquitous proteins like Hsp70, 
enolase, annexin II, cathepsin C or collagen II, as well as from MHC molecules (HLA-A, - 
B, -C, -E, -G, and ^-microglobulin) and CLIP (Chicz, R.M. et al., J Exp Med 178 (1993) 
27-47; Sinigagiia, F. 8c Hammer, J., Curr Opin Immunol 6 (1994) 52-56; Arnold-S child, D. 
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et al., J Immunol 162 (1999) 3757-3760; Vogt, A.B. & Kropshofer, HL, Trends 
Biochem Sci 4 (1999) 150-154) were frequently detected. 

RA-specific peptide sequences were further validated with regard to binding to the 
RA susceptibility allele DRBJ*0401 by using the TEPITOPE software (Hammer, J. et al., 
5 Adv Immunol 66 (1997) 67-100). This software provides means for the qualitative and 
quantitative prediction of T cell epitopes. 

The output of the study consists of an epitope that occurred, apart from one 
exception, only in non-erosive RA samples (Table 1). 



1 0 Interferon-gamma-inducible lysosomal thiol reductase 

A very interesting epitope which was identified in 3 out of 5 non-erosive RA samples 
from serum and synovia is derived from the interferon-gamma-inducible lysosomal thiol 
reductase (GILT): the 16-mer GILT (192-207) with the amino acid sequence of SEQ ID 
NO: 3 (Table 1). Further length variants in three other samples support the relevance of 
15 this epitope (Table 1): the 14-mer GILT (192-205; SEQ ID NO: 1) and the 17-mer GILT 
(192-208; SEQ ID NO: 2). 

As judged from the shortest length variant, GILT (192-205), the epitope contains a 
suitable binding motif, with regard to binding to the RA susceptibility allele DRB 1*0401: 
196M serves as a PI anchor, 199M as a P4 anchor and 201A as a P6 anchor. According to 
20 TEPITOPE scoring, the epitope has a binding score (threshold value) of 1% which is 
similar to the binding score of an epitope from influenza haemagglutinin (307-319) that 
was shown to be a strong DRB1*0401 binder (Table 1) (Rothbard, J.B. et al, Cell 52 (1988) 
515-523). 

GILT is constitutively expressed in antigen-presenting cells, such as dendritic cells, 
25 macrophages and B cells, and facilitates unfolding of endocytosed antigens in MHC class 
II-containing compartments (MIIC) by enzymatically reducing disulfide bonds (Phan, 
U.T. et al., J Biol Chem 275 (2000) 25907-25914). Direct binding of GILT to HLA-DR 
molecules has been reported for B cells (Arunachalam, B. et al., J Immunol 160 (1998) 
5797-5806). A rather long second epitope of GILT was found to bind to HLA-DR3 
30 molecules: the 22-mer GILT (38-59) having the amino acid sequence 
SPLQALDFFGNGPPVNYKTGNL (Chicz, R.M. et al., J Exp Med 178 (1993) 27-47). 

In addition to GILT (192-207), another epitope of the same protein was identified in 
several RA samples, but also in control samples: GILT (210-227) with the amino acid 
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sequence QPPHEYVPWVTVNGKPLE. This epitope was accompanied by 3 other length 
variants: the 16-mer GILT (210-225), the 17-mer GILT (210-226) and the 19-mer 
GILT (210-228). 

As indicated by its name, GILT expression can be induced by the pro-inflammatory 
5 cytokine interferon gamma (IFN-y) in various types of cells, including macrophages, 
endothelial cells and fibroblasts (Luster, A.D. et al., J Biol Chem 263 (1988) 12036-12043). 
As IFN-Y is known to be present in inflamed joints of RA patients, GILT could become 
over-expressed in synovia and serum and, hence, could be taken up by DCs as an 
exogenous antigen. GILT (192-207) may be derived from exogenous GILT. The other 
10 GILT epitope, which is also present in the control samples, may be derived from 
endogenous GILT, expressed by DCs. Alternatively, both GILT (192-207) and GILT (210- 
227) may be derived from endogenous GILT, in case that GILT processing and GILT- 
derived epitope presentation by DCs were critically altered upon contact with RA- 
associated material. 

15 

Example 2 

In this example, the same technology was used that has been described in detail in 
example 1. Serum (4 samples) and synovial fluid (2 samples) of patients with diagnosed 
erosive RA were utilized in this case to identify candidate markers specific for erosive RA. 

20 The peptide sequences found in the erosive RA samples were compared with the 

sequences identified in unpulsed DCs (control 1) and in DCs pulsed with the serum of 
healthy test persons (control 2). Among the RA-specific sequences, only those peptides 
were selected for further evaluation that re-occurred in at least three of six erosive RA 
samples. 

25 In this study one epitope was discovered which occurred, apart from one exception, 

only in erosive RA samples. 



Apolipoprotein B-100 

The epitope which was mainly found in erosive RA sera (4 out of 6 erosive RA 
30 samples) is derived from apolipoprotein B-100: the 16-mer ApoB (2877-2892) with the 
amino acid sequence of SEQ ID NO: 4 (Table 2). In addition a length variant of the same 
epitope was identified (Table 2): the 17-mer ApoB (2877-2893; SEQ ID NO: 5). The 
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following DRB1*0401 binding motif can be predicted! 2881L as a PI anchor, 2884D as a 
P4 anchor and 2886N as a P6 anchor (binding score 3%). 

In an earlier study on EBV-B cells, the epitope ApoB (2885-2900), which partly 
overlaps with the epitope described here, has been found in the context of HLA-DR4 
5 (Chicz, R.M. et aL, J Exp Med 178 ( 1993) 27-47). 

Apolipoprotein B-100 is a constituent of very low-density lipoproteins (VLDL) and 
low-density lipoproteins (LDL) and functions as a recognition signal for the cellular 
binding and internalization of LDL particles by the ApoB/E receptor (Yang, C.Y. et aL, 
Nature 323 (1986) 738-742). Interestingly, an increased ratio of LDL cholesterol to HDL 

10 cholesterol was observed among newly diagnosed RA patients (Park, Y.B. et al., J 
Rheumatol 26 (1999) 1701-1704). The adverse lipid profile in active RA could be improved 
by treating RA patients with DMARDs without the use of lipid-lowering agents (Park, Y.B. 
et al., Am J Med 113 (2002) 188-193). Since an increased cardiovascular mortality among 
patients with chronic inflammatory diseases, such as RA, is well documented (Symmons, 

15 D.P. et al., J Rheumatol 25 (1998) 1072-1077) it was suggested that local inflammation in 
RA leads to altered blood lipid levels, thereby increasing the risk of atherosclerosis. The 
question whether components of the lipoprotein metabolism are causal for pathogenesis or 
merely affected by ongoing immune reactions during RA development cannot be answered 
yet. However, the observation of adverse lipid profiles in RA patients supports the validity 

20 of the presented ApoB epitope as a serum-derived RA candidate marker. 

The length variant ApoB (2877-2892), but not ApoB (2877-2893), has been 
identified in samples of two healthy controls (Table 2). Since Apolipoprotein B constitutes 
1% of all plasma proteins, the presence of ApoB epitopes in healthy control samples is not 
surprising. The results suggest that only the length variant ApoB (2877-2893; SEQ ID NO: 
25 5) is specific for erosive RA. 



Example 3 

All peptide sequences identified in examples 1 and 2 from non-erosive and erosive 
RA samples were used in this example to search for common markers relevant for both RA 
30 types. The RA-specific sequences were again compared with peptide sequences of the 
control samples (unpulsed DCs and DCs pulsed with the serum of two healthy test 
persons) and only those peptides were selected for further evaluation that re-occurred in at 
least three of altogether eleven RA samples (erosive and non-erosive RA). 
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Inter-alpha-trypsin inhibitor 

Six out of seven serum samples (erosive & non-erosive RA) but not any of the 
controls gave rise to an epitope derived from the heavy chain H4 of the inter-alpha-trypsin 
inhibitor: ITIH4 (271-287) with the amino acid sequence of SEQ ID NO: 8 (Table 3). 
Apart from this major length variant of the ITIH4 epitope, six length variants of the same 
ITIH4 epitope could be identified (Table 3): the 19-mer ITIH4 (271-289; SEQ ID NO: 6), 
the 18-mer ITIH4 (271-288; SEQ ID NO: 7), the 16-mer ITIH4 (274-289; SEQ ID NO: 12), 
the 15-mer ITIH4 (273-287; SEQ ID NO: 10), the 15-mer (274-288; SEQ ID NO: 11) and 
the 14-mer ITIH4 (274-287; SEQ ID NO: 9). 

As judged from the shortest length variant, ITIH4 (274-287), the epitope contains a 
very strong binding motif, with regard to binding to the RA susceptibility allele 
DRB 1*0401: 277F serves as a PI anchor, 280D as a P4 anchor and 282S as a P6 anchor 
(binding score: 1%). 

ITIH4 belongs to the Inter-alpha-inhibitor (loci) family which is a group of serum 
protease inhibitors that bind to hyaluronic acid (HA) and appear to be involved in acute- 
phase reactions (Salier, J.P. et al., Biochemical Journal 315 (1996) 1-9). 

HA is a polysaccharide found in all tissues of the body, in particular, in loose 
connective tissue, e.g. joint fluid (Evered, D. 8c Whelan, J. eds., The Biology of Hyaluronan, 
John Wiley 8c Sons (1989)). HA has an important structural function in cartilage and other 
tissues where it stabilizes the extracellular matrix by forming aggregates with 
proteoglycans. It has also been assigned important biological functions by regulating 
cellular activities via binding to cell surface proteins, such as CD44 and ICAM-1 
(Knudson, C.B. 8c Knudson, W., FASEB J 7 (1993) 1233-1241; Hall, C.L. et al, J Cell Biol 
126 (1994) 575-588). RA is accompanied by a large increase in total HA in the joint fluid as 
well as in the serum, suggesting that circulating HA originates from rheumatoid joints 
(Engstrom-Laurent, A. et al., Scand J Clin Lab Invest 45 (1985) 497-504). 

Complexes of HA and some loci family members were observed in large amounts in 
the synovial fluid of RA patients (Jessen, T.E. et al., Biological Chemistry Hoppe-Seyler 375 
(1994) 521-526). The role of the IccI-HA complex in inflammatory reactions might be to 
modify the CD44-HA interaction that mediates leukocyte activation and invasion (Isacke, 
CM. 8c Yarwood, H., Int J Biochem Cell Biol 34 (2002) 718-721). Additionally, synovial 
fluid of RA patients contains elevated levels of TSG-6, an anti-inflammatory glycoprotein 
and a member of the hyaladherin family of HA-binding proteins (Wisniewski, H.G. et al., J 
Immunol 151 (1993) 6593-6601). It has been shown that a complex of TSG-6 with Ial 
family members inhibits the activity of plasmin, a central molecule in the activation of 
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inflammation-associated enzymes (Wisniewski, H.G. et al., J Immunol 156 

(1996) 1609-1615), A regulation of plasmin activity by several acute-phase plasma 
proteins, namely TSG-6 and led family members, may prove to be important in RA, given 
the high contents of HA, TSG-6 and led family members in synovial fluid of inflamed 
5 joints. 

This evidence, together with the identification of multiple length variants of the same 
epitope and a strong HLA-DR4 binding motif, convincingly support the validity of the 
presented ITIH4 epitope as a serum-derived RA candidate marker. 



10 Complement C4 

In all erosive and non-erosive RA sera tested, another dominant epitope was 
identified which is derived from complement C4: the 15-mer C4 (1697-1711) with the 
amino acid sequence of SEQ ID NO: 13 (Table 3). Five additional length variants of the 
same epitope could be found (Table 3): the 12-mer C4 (1697-1708; SEQ ID NO: 18), the 
15 13-mer C4 (1698-1710; SEQ ID NO: 17), the 14-mer C4 (1697-1710; SEQ ID NO: 15), the 
16-mer C4 (1697-1712; SEQ ID NO: 14) and the 18-mer C4 (1697-1714; SEQ ID NO: 16). 
Moreover, the presented epitope displays a very strong DRB1*0401 binding motif: 1700Y 
as PI anchor, 1704D as P2 anchor and 1706N as P6 anchor (binding score: 1%). 

C4 which constitutes approximately 0.5% of plasma protein mass plays a critical role 
20 in the triggering of the central pathway of the complement system. The protein is 
synthesized as a single-chain precursor and, prior to secretion, is enzymatically cleaved to 
form a trimer of non-identical a-, (3-, and y-chains. The identified epitope C4 (1697-1711) 
is located at the very C-terminus of the C4 y-chain. The C4 a-chain is further 
proteolytically degraded by activated CI to form the C4a anaphylatoxin, which is a 
25 mediator of local inflammatory processes (Moon, K.E. et al., J Biol Chem 256 (1981) 8685- 
8692). 

In general, the complement cascade is involved in the induction and progression of 
inflammatory reactions and is a major defense system against various pathogenic agents, 
including bacteria, viruses and other antigens (Morgan, B.P., Methods Mol Biol 150 (2000) 
30 1-13). Inappropriate activation, however, can lead to tissue damage and manifestation of 
disease (Speth, C et al., Wien Klin Wochenschr 111 (1999) 378-391). 

Activation of the complement system has been repeatedly implicated in the 
pathogenesis of RA, based on studies showing increased levels of complement metabolites, 
including C4 and C4a, in plasma, synovial fluid and synovial tissue of RA patients 
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(Neumann, E. et aL, Arthritis Rheum 46 (2002) 934-945). In addition collagen- 
induced arthritis (CIA) in mice is characterized by the presence of 
complement activation products (Linton, S.M. & Morgan, B.P., Mol Immunol 36 (1999) 
905-914). CIA is prevented after treatment with anti-C5 monoclonal antibodies (Wang, Y. 

5 et aL, PNAS 92 (1995) 8955-8959) or with soluble CR1, an inhibitor of the complement 
system, delivered by gene therapy (Dreja, H. et aL, Arthritis Rheum 43 (2000) 1698-1709). 
Activation of complement factors in joints is possibly induced by the presence of various 
immune complexes and it was hypothesized that stimulation of the innate immune system 
by infectious agents and cytokines may contribute to the initiation of RA (Friese, M.A. et 

10 aL, Clin Exp Immunol 121 (2000) 406-414). 

Two of the six presented C4 epitopes, the 15- and the 18-mer, were also identified in 
healthy control samples (Table 3) indicating that only some length variants of this C4 
epitope are RA-specific, namely the antigenic peptides of SEQ ID NOs: 14, 15, 17, and 18. 



15 Complement C3 

Another epitope that was found in erosive and non-erosive RA samples is derived 
from complement C3: the 14-mer C3 (1431-1444) with the amino acid sequence of SEQ ID 
NO: 21 (Table 3). Four additional length variants of the same epitope were identified in 
serum (Table 3): the 13-mer C3 (1431-1443; SEQ ID NO: 23), the 15-mer C3 (1431-1445; 
20 SEQ ID NO: 22), the 15-mer C3 (1429-1443; SEQ ID NO: 20) and the 19-mer C3 (1426- 
1444; SEQ ID NO: 19). As judged from the shortest length variant, C3 (1431-1443), a 
DRB 1*0401 binding motif can be postulated: 1434Y serves as a PI anchor, 1437D as a P4 
anchor and 1439A as a P6 anchor. 

Complement C3 which constitutes about 1-2% of plasma protein mass plays a 
25 central role in the activation of the complement system and belongs to the family of the 
acute-phase proteins. Its processing by C3 convertase to C3a anaphylatoxin and C3b is the 
central step in both the classical and alternative complement pathways (Barrington, R. et 
aL, Immunol Rev 180 (2001) 5-15). After activation, C3b can bind covalently, via a reactive 
thiolester, to cell surface carbohydrates or immune aggregates (Isaac, L. & Isenman, D.E.> J 
30 Biol Chem 267 (1992) 10062-10069). The identified epitope C3 (1431-1444) is located at 
the C-terminus of C3b. 

As already discussed in the context of complement epitope C4 (1697-1711), there is 
increasing evidence for an important role of components of the complement cascade in the 
pathophysiology of RA. The result of this study, in which two major epitopes derived from 
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complement C3 and C4 were identified in serum of RA patients, underlines the close | 
link between the activated complement system and pathogenesis of RA. This I 
coincidence makes a strong argument for the validity of the presented C3 / C4 epitopes as j 
serum-derived candidate RA markers. 

5 

SH3 domain-binding glutamic acid-rich-like protein 3 

Another epitope which was elucidated quite frequently in serum of RA patients (5 
out of 7 erosive and non-erosive RA samples), is derived from the SH3 domain-binding 
glutamic acid-rich-like protein 3 (SH3BGRL3): SH3BGRL3 (15-26) with the amino acid 
10 sequence of SEQ ID NO: 25 (Table 3). Three length variants of the same epitope were 
identified (Table 3): the 14-mer SH3BGRL3 (13-26; SEQ ID NO: 26), the 14-mer 
SH3BGRL3 (15-28; SEQ ID NO: 27) and the 16-mer SH3BGRL3 (13-28; SEQ ID NO: 24). 
The DRB1*0401 binding motif is: 171 as PI anchor, 20Q as P4 anchor and 22S as P6 
anchor (binding score 4%). 

15 SH3BGRL3 is a small 10 kD protein that belongs to the SH3BGR family. The precise 

function of the protein is unknown but a role as a modulator of glutaredoxin biological 
activity is postulated (Mazzocco, M. et aL, Biochem Biophys Res Commun 285 (2001) 540- 
545). So far, SH3BGRL3 has not been described in the context of RA. 

Interestingly, the analysis elucidated a second epitope of the same protein, which was 
20 highly abundant in all RA and control samples: the 16-mer SH3BGRL3 (29-44) with the 
amino acid sequence DGKRIQYQLVDISQDN. In addition multiple length variants of the 
same epitope were found in most samples as well. As judged from the shortest length 
variant, SH3BGRL3 (31-42), the epitope contains almost similar DRB1*0401 anchor 
residues compared with SH3BGRL3 (15-26): 331 serves as a PI anchor, 36Q as a P4 anchor 
25 and 38V as a P6 anchor (binding score -2). This similarity is reflected by comparable 
binding scores. 

The presence of this second SH3BGRL3 epitope supports the validity of the 
SH3BGRL3 (15-26) epitope because both peptides are derived from the same protein, 
however, only one of them, epitope SH3BGRL3 (15-26), appears to be generated in a RA- 
30 specific manner. A similar observation has been described already for GILT in example 1. 

Among the four SH3BGRL3 length variants the longest variant, SH3BGRL3 (13-28), 
was also identified in a healthy control sample (Table 3). However, this particular length 
variant was found only one time, which indicates a significant enrichment of the 
SH3BGRL3 epitope in the context of RA. 
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Interleukin-4 (IL-4) induced protein 1 

In all the investigated sera and synovial fluids (erosive 8c non-erosive RA), one highly 
dominant epitope was identified which is derived from the human homolog of the IL-4 
induced protein 1 (Figl): Figl (293-309) with the amino acid sequence of SEQ ID NO: 28 
5 (Table 3). The validity of the epitope was further supported by the presence of additional 
length variants in several samples (Table 3): the 16-mer Figl (293-308; SEQ ID NO: 30) 
and the 19-mer Figl (293-311; SEQ ID NO: 29). Moreover, the amino acid sequence 
displays a typical DRB1*0401 binding motif: 299V serves as PI anchor, 302E as P4 anchor 
and 304S as P6 anchor (binding score 1%). 

10 Two length variants of the same epitope, Figl (293-308) and Figl (293-309), were 

identified in one unpulsed sample and in one healthy control sample as well (Table 3). 
However, the presence of the Figl epitope in all RA samples but not in all of the control 
samples tested strongly indicates an enrichment in the context of RA. 

The human figl gene was first identified in IL-4-stimulated B cell cultures (Chu, CO 
15 8c Paul, W.E., PNAS 94 (1997) 2507-2512). The human figl resides on chromosome 
19ql3.3-19ql3.4, a region previously identified to be involved in susceptibility to 
autoimmune diseases, including SLE, arthritis, multiple sclerosis, and insulin-dependent 
diabetes mellitus (Becker K.G. et al., PNAS 95 (1998) 9979-9984). Since its expression is 
largely limited to immune tissues and its regulation is dependent on IL-4, a key modulator 
20 of the immune response, figl is thus an attractive candidate gene for autoimmune disease 
susceptibility (Chavan, S.S. et al., Biochim Biophys Acta 1576 (2002) 70-80). The HLA- 
DR4-restricted presentation of a Figl epitope provides the first indication that Figl protein 
is produced and possibly involved in the disease development of RA. The Figl polypeptide 
has not been known as a marker for RA until now, and is considered as an important 
25 candidate marker for RA. 



Hemopexin 

Another RA candidate marker which was frequently identified in serum samples (6 
out of 7 samples) and in synovia samples (2 out of 4 samples) (erosive 8c non-erosive RA) 
30 is derived from hemopexin (HPX): HPX (351-367) with the amino acid sequence of SEQ 
ID NO: 32 (Table 3). Several length variants were found which support the validity of this 
epitope (Table 3): the 13-mer HPX (351-363; SEQ ID NO: 33), the 14-mer HPX (350-363; 
SEQ ID NO: 34), the 15-mer HPX (351-365; SEQ ID NO: 35) and the 18-mer HPX (351- 
368; SEQ ID NO: 31). Furthermore, the epitope contains a very strong DRB1*0401 binding 
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motif: 3551 serves as a PI anchor, 358D as a P4 anchor and 360V as a P6 anchor 
(binding score: 1%). 

Two length variants of the same epitope, HPX (351-367; SEQ ID NO: 32) and HPX 
(351-365; SEQ ID NO: 35), could also be identified in healthy control samples (Table 3) 
5 indicating that only some length variants are specific for RA, namely the antigenic peptides 
of SEQ ID NOs. 31, 33 and 34. 

HPX is a 60 kD plasma glycoprotein with a high binding affinity to heme (Muller- 
Eberhard, IL, Methods Enzymol 163 (1988) 536-565). It is mainly expressed in the liver, 
and belongs to the acute-phase proteins the synthesis of which is induced in an 
10 inflammatory situation. RA is a chronic inflammatory autoimmune disease and elevated 
levels of several acute-phase proteins, including C-reactive protein and serum amyloid A, 
have been reported (Nakamura, R., J Clin Lab Anal 14 (2000) 305-313). HPX is responsive 
to the cytokines IL-1 and IL-6, which are upregulated in patients suffering from RA 
(Feldmann, M. & Maini, R.N., Rheumatology 38, Suppl 2 (1999) 3-7). 

15 HPX is the major vehicle for the transportation of heme in the plasma and its 

principal role is to prevent heme-mediated oxidative stress and loss of heme-bound iron 
(Tolosano, E. & Altruda, R, DNA Cell Biol 21 (2002) 297-306). It can protect cells against 
oxidative stress by inducing the expression of intracellular antioxidants such as heme 
oxygenase, metallothioneins and ferritin. Metallothioneins are cytosolic proteins that are 

20 expressed particularly in synovial fibroblasts (Backman, J.T. et al., Virchows Arch 433 
(1998) 153-160). There is significant experimental evidence for the presence of oxidative 
stress in the synovial tissue of RA patients (reviewed in: Schett, G. et ah, Arthritis Res 3 
(2000) 80-86). Furthermore HPX was reported to promote proliferation of human T 
lymphocytes (Smith, A. et al., Exp Cell Res 232 (1997) 246-254). These studies render it 

25 likely that HPX belongs to the up- regulated proteins in serum and synovia of RA patients, 
thereby providing a rationale for the relevance of HPX (351-367) as a RA-specific 
candidate marker. 



Hsc70-interacting protein 

30 An epitope which was mostly identified in serum samples (4 out of 7 erosive and 

non-erosive RA samples) and which is also related to stress responses is derived from the 
Hsc70-interacting protein Hip: Hip (83-98) with the amino acid sequence of SEQ ID NO: 
38 (Table 3). Two length variants of this epitope were identified (Table 3): the 18-rner Hip 
(83-100; SEQ ID NO: 36) and the 15-mer Hip (84-98; SEQ ID NO: 39). An additional 
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length variant was discovered in one erosive synovia sample (Table 3): the 15-mer Hip 
(85-99; SEQ ID NO: 37). As judged from the shortest length variant Hip (84-98) a 
DRB1-0401 binding motif attaining a moderate score of 8%, can be postulated: 891 as Pi 
anchor, 92D as P4 anchor, 94D as P6 anchor. 

In the cytosol of eukaryotic cells, Hip and Hop proteins associate with Hsc70 in 
order to participate in the regulation of Hsc70 chaperone activity (Frydman, J. 8c Hohfeid, 
J., Trends Biochem Sci 22 (1997) 87-92). The 42 kD Hip protein binds to the ATPase 
domain of Hsc70. It was postulated that Hip might increase the half-life of the chaperone- 
substrate complex providing the molecular basis for an efficient cooperation of Hsc70 with 
downstream chaperone systems. Hsc70 and Hsp90 have been shown to cooperate during 
protein folding in vitro (Jakob, U. & Buchner, J., Trends Biochem Sci 19 (1994) 205-211- 
Freeman, B.C. 8c Morimoto, R.I., EMBO J 15 (1996) 2969-2979) and to play a role in 
thermal denaturation (Schneider, C. et al., PNAS 93 (1996) 14536-14541). The Hsc70 and 
Hs P 90 association with stress-adaptation ultimately links Hip to stress responses, including 
the induction of heat shock proteins, in the synovial tissue of RA patients (reviewed in- 
Schett, G. et al., Arthritis Res 3 (2001) 80-86). 
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Claims 

1* A MHC class II antigenic peptide comprising 

(a) at least the amino acid sequence of the peptide binding motif selected from the 
group consisting of SEQ ID NOs. 49 to 57, or 

5 (b) at least the amino acid sequence of the peptide binding motif selected from the 

group consisting of SEQ ID NOs. 49 to 57 with additional N-and C- terminal 
flanking sequences of a corresponding sequence selected from the group consisting 
ofSEQIDNOs. lto 39. 

2. A MHC class II antigenic peptide comprising 
10 (a) at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 

49, or 

(b) at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 

49 with additional N-and C-terminal flanking sequences of a corresponding 
sequence selected from the group consisting of SEQ ID NOs. 1 to 3. 

15 3. A MHC class II antigenic peptide comprising 

(a) at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 

50, or 

(b) at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 

50 with additional N-and C-terminal flanking sequences of the corresponding 
20 sequence of SEQ ID NO. 5. 

4. The MHC class II antigenic peptide according to any one of claims 1 to 3 linked to a 

MHC class II molecule. 

5. An antibody reactive with a MHC class II antigenic peptide according to any one of 

claims 1 to 3. 

25 6. A nucleic acid molecule encoding a peptide or polypeptide according to any one of 
claims 1 to 4. 

7. A recombinant nucleic acid construct comprising the nucleic acid molecule 

according to claim 6 operably linked to an expression vector. 

8. A host cell containing the nucleic acid construct according to claim 7. 

30 9. A method for producing a MHC class II antigenic peptide according to any one of 
claims 1 to 3 comprising the steps of culturing the host cell of claim 8 under 



101 



conditions allowing expression of 
peptide from the cells or the culture 



said peptide and recovering the 
medium. 



10. A method for isolating and identifying MHC class II associated RA antigenic 
peptides in femtomolar amounts, which method comprises 

(a) providing immature dendritic cells in a number comprising 0.1 to 5 ng MHC 
class II molecules; 

(b) contacting the cells of (a) with serum or synovial fluid and inducing maturation 
of dendritic cells by adding TNFalpha; 

(c) isolating class II MHC molecule-antigenic peptide complexes from the cells 
with methods comprising solubilization of the cells and sequestration of the 
complexes of MHC class II molecules with antigenic peptides by 
immunoprecipitation or immunoaffinity chromatography, 

(d) washing the sequestered complexes of MHC class II molecules with antigenic 
peptides with water in an ultrafiltration tube; 

(e) eluting the associated antigenic peptides from the MHC class II molecules at 
37°C with diluted trifluoro acetic acid, and 

(f) separating, detecting and identifying the isolated peptides by liquid 
chromatography and mass spectrometry. 

11. The method according to claim 10, wherein in step (f) of the method the liquid 
chromatography comprises a first linear elution step from the reversed-phase 
material with a volume sufficient to elute contaminants prior to the peptide elution 
step. 

12. The method according to any one of claims 10 and 11, further comprising 

(g) analyzing the identified peptides by methods comprising a database and a 
software developed to perform comparative data analysis across multiple datasets. 

13. A pharmaceutical composition comprising a MHC class II antigenic peptide 
according to any one of claims 1 to 3, an antibody according to claim 5, or a 
polypeptide selected from the group consisting of SEQ ID NOs 40 to 48, and 
optionally a pharmaceutical^ acceptable carrier. 

14. A diagnostic composition comprising the antibody according to claim 5. 

15. The use of the MHC class II antigenic peptide according to claim 1, wherein the 
antigenic peptide is a marker for erosive and/or non-erosive RA. 

16. The use of the MHC class II antigenic peptide according to claim 2, wherein the 
antigenic peptide is a marker for non-erosive RA 
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17. The use of the MHC class II antigenic peptide according claim 3, 
wherein the antigenic peptide is a marker for erosive RA. 

18. The use of a polypeptide selected from the group consisting of SEQ ID NOs 40 to 
48 as a marker for RA, preferably for erosive and/or non-erosive RA. 

.19. The antigenic peptides, antibodies, nucleic acids, host cells, methods, compositions 
and uses substantially as herein before described especially with reference to the 
foregoing Examples. 
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Abstract 

The present invention provides novel naturally-processed MHC class II antigenic 
peptides which originate from interferon-y-inducible lysosomal thiol reductase, 
apolipoprotein B-100, inter-ct-trypsin inhibitor heavy chain H4, complement C4, 
5 complement C3, SH3 domain-binding glutamic acid-rich-like protein 3, interleukin-4- 
induced protein 1, hemopexin> and Hsc70-interacting protein. Also provided are these 
antigenic peptides and the proteins they are derived from as markers for erosive and/or 
non-erosive RA. Moreover, these antigenic peptides linked to MHC class II molecules, 
antibodies reactive with said antigenic peptides, nucleic acids encoding said antigenic 
10 peptides, and nucleic acid constructs, host cells and methods for expressing said antigenic 
peptides are provided. The antigenic peptides of the invention can be used as markers in 
diagnosis of RA and in therapy as an ti-RA vaccines. 
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<120> RA antigenic peptides 
<130> 21796 
<160> 57 

<170> Patentln version 3-2 



<210> 1 

<211> 14 

15 <212> PRT 

<213> Homo sapiens 



<400> 1 

Gly Asp Arg Gly Met Gin Leu Met His Ala Asn Ala Gin Arg 



<210> 


2 


<211> 


17 


<212> 


PRT 


<213> 


Homo 


<400> 


2 


Gly Asp Arg 


Ala 




<210> 


3 


<211> 


16 


<212> 


PRT 


<213> 


Homo 


<400> 


3 



10 



15 



Gly Asp Arg Gly Met Gin Leu Met His Ala Asn Ala Gin Arg Thr Asp 

10 15 



50 <210> 4 

<211> 16 

<212> PRT 

<213> Homo sapiens 

55 <400> 4 



lie Asn Asn Gin Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys 

<210> 5 

<211> 17 

<212> PRT 

•<213> Homo sapiens 

<400> S 

lie Asn Asn Gin Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys 
70 s 10 15 

Leu 



60 



65 



75 
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<210> 6 

<211> 19 

<212> PRT 

<213> Homo sapiens 

.<400> 6 

Met Pro Lys Asn Val Val Phe Val lie Asp Lys Ser Gly Ser Met Ser 
15 10 15 



Gly Arg Lys 



<210> 7 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Met Pro Lys Asn Val Val Phe Val lie Asp Lys Ser Gly Ser Met Ser 
15 10 15 



Gly Arg 



<210> 8 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Pro Lys Asn Val Val Phe Val lie Asp Lys Ser Gly Ser Met Ser 
15 10 15 



Gly 



<210> 9 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Asn Val Val Phe Val lie Asp Lys Ser Gly Ser Met Ser Gly 
15 10 



<210> 10 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Lys Asn Val Val Phe Val He Asp Lys Ser Gly Ser Met Ser Gly 
15 10 15 



<210> 11 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 11 



Asn Val Val Phe Val He Asp Lys Ser Gly Ser Met Ser Gly Arg 
15 10 15 
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<210> 12 

<211> 16 

<212> PRT 

5 <213> Homo sapiens 



10 



20 



30 



45 



50 



<400> 12 



Asn Val Val Phe Val He Asp Lys ser Gly Ser Met Ser Gly Arg Lys 



<210> 13 

<211> 15 

15 <212> PRT 

<213> Homo sapiens 



<400> 13 

Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp He Glu Glu 
5 10 15 



<210> 14 

25 <211> 16 

<212> PRT 

<213> Homo sapiens 



<400> 14 

Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp He Glu Glu Met 



15 



35 <210> 15 

<211> 14 

<212> PRT 

<213> Homo sapiens 

40 <400> 15 



Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp He Glu 
5 10 



<210> 16 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 16 



Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp He Glu Glu Met 
55 10 15 



Pro Ser 



60 



65 



75 



<210> 17 

<211> 13 . 

<212> PRT 

<213> Homo sapiens 

<400> 17 



His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp He Glu 
70 5 10 



<210> 18 

<211> 12 

<212> PRT 

<213> Homo sapiens 



<400> 18 
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Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp 
15 10 

<210> 19 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Gly Val Asp Arg Tyr He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser 
15 10 15 

Asp Arg Asn 



<210> 20 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Arg Tyr He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp Arg 
1 5 ' 10 15 

<210> 21 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 21 

He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp Arg Asn 
15 10 

<210> 22 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 22 

He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp Arg Asn Thr 
15 10 15 

<210> 23 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 23 

He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp Arg 
15 10 

<210> 24 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Gly Ser Arg Glu He Lys Ser Gin Gin Ser Glu Val Thr Arg He Leu 
15 10 15 

<210> 25 
<211> 12 
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<212> PRT 

<213> Homo sapiens 

<400> 25 



Arg Glu lie Lys Ser Gin Gin Ser Glu Val Thr Arg 
5 10 



10 



5 



10 



15 



20 



25 



30 5 l ° 



35 



40 



45 



50 
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60 



<210> 


26 


<211> 


14 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


26 


Gly Ser Arg Glu lie 


1 


5 


<210> 


27 


<211> 


14 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


27 


Arg Glu lie Lys Ser 


1 


5 


<210> 


28 


<211> 


17 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


28 


Gly Pro His Asp Val 


1 


5 


Asn 




<210> 


29 


<211> 


19 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


29 



10 



15 



Gly Pro His Asp Val His Val Gin He Glu Thr Ser Pro Pro Ala Arg 

10 15 

Asn Leu Lys 



<210> 30 

<211> 16 

<212> PRT 

OO <213> Homo sapiens 



<400> 30 



70 



Gly Pro His Asp Val His Val Gin He Glu Thr Ser Pro Pro Ala Arg 
5 10 15 



<210> 31 

<211> 18 

/-> <212> PRT 

<213> Homo sapiens 
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<400> 31 

Thr Pro His Gly He He Leu Asp Ser Val Asp Ala Ala Phe He Cys 
15 10 15 

Pro Gly 



<210> 32 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Thr Pro His Gly He He Leu Asp Ser Val Asp Ala Ala Phe He Cys 
15 10 15 

Pro 



<210> 33 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Thr Pro His Gly He He Leu Asp Ser Val Asp Ala Ala 
15 10 

<210> 34 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Gly Thr Pro His Gly He He Leu Asp Ser Val Asp Ala Ala 
15 10 

<210> 35 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Thr Pro His Gly He He Leu Asp Ser Val Asp Ala Ala Phe He 
15 10 15 

<210> 36 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 36 

He Asp Lys Glu Gly Val He Glu Pro Asp Thr Asp Ala Pro Gin Glu 
15 10 15 

Met Gly 



<210> 37 

<211> 15 

<212> PRT 

<213> Homo sapiens 
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<400> 37 



Lys Glu Gly Val lie Glu Pro Asp Thr Asp Ala Pro Gin Glu Met 
5 10 



15 



<210> 38 

<211> 16 

10 <212> PRT 

<213> Homo sapiens 

<400> 38 



15 lie Asp Lys Glu Gly Val He Glu Pro Asp Thr Asp Ala Pro Gin Glu 

5 10 15 



<210> 39 

20 <211> 15 

<212> PRT 

<213> Homo sapiens 



25 



45 



65 



<400> 39 



Asp Lys Glu Gly Val He Glu Pro Asp Thr Asp Ala Pro Gin Glu 
5 « 15 



30 <210> 40 

<211> 261 

<212> PRT 

<213> Homo sapiens 

35 <3oo> 

<308> Swiss-Prot/P13284 

<309> 1990-01-01 

<313> (1)..(261) 

40 <400> 40 



Met Asp ser Arg His Thr Phe Ala Pro Ala Ala Met Thr Leu Ser Pro 



15 



Leu Leu Leu Phe Leu Pro Pro Leu Leu Leu Leu Leu Asp Val Pro Thr 

25 30 



50 Ala Ala val Gin Ala Ser Pro Leu Gin Ala Leu Asp Phe Phe Gly Asn 

40 45 

55 fS° Pr ° Val ASn *** £ S Thr G1 V **n Leu Tyr Leu Arg Gly Pro 

Leu Lys Lys Ser Asn Ala Pro Leu Val Asn Val Thr Leu Tyr Tyr Glu 

60 75 so 



Ala Leu Cys Gly Gly Cys Arg Ala Phe Leu lie Arg Glu Leu Phe Pro 

90 95 

Thr Trp Leu Leu Val Met Glu He Leu Asn Val Thr Leu Val Pro Tyr 
70 Gly Asn Ala Gin Glu Gin Asn Val Ser Gly Arg Trp Glu Phe Lys Cys 

75 Gln i3o Gly Glu Glu Glu Cy f Lys phe Asn Val Glu Ala cys Val 

135 140 



60 



Leu Asp Glu Leu Asp Met Glu Leu Ala Phe Leu Thr lie Val Cys Met 
145 150 155 160 

5 Glu Glu Phe Glu Asp Met Glu Arg Ser Leu Pro Leu Cys Leu Gin Leu 

165 170 175 

Tyr Ala Pro Gly Leu Ser Pro Asp Thr He Met Glu Cys Ala Met Gly 
10 180 185 190 

Asp Arg Gly Met Gin Leu Met His Ala Asn Ala Gin Arg Thr Asp Ala 
195 200 205 

15 

Leu Gin Pro Pro His Glu Tyr Val Pro Trp Val Thr Val Asn Gly Lys 
210 215 220 

20 

Pro Leu Glu Asp Gin Thr Gin Leu Leu Thr Leu Val Cys Gin Leu Tyr 
225 230 235 240 

25 Gin Gly Lys Lys Pro Asp Val Cys Pro Ser Ser Thr Ser Ser Leu Arg 

245 250 255 

Ser Val Cys Phe Lys 
30 260 

<210> 41 

<211> 4563 

35 <212> PRT 

<213> Homo sapiens 

<300> 

<308> Swiss-Prot/P04114 

40 <309> 1986-11-01 

<313> (1)..(4563) 

<400> 41 

45 Met Asp Pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu Pro Ala 
15 10 15 

Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu Met Leu 
50 20 25 30 

Glu Asn Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe Lys His 
35 40 45 

55 

Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser Gly Val 
50 55 60 

60 

Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg He Asn Cys Lys Val 
65 70 75 80 

65 Glu Leu Glu Val Pro Gin Leu Cys Ser Phe He Leu Lys Thr Ser Gin 

85 90 95 

Cys Thr Leu Lys Glu Val Tyr Gly Phe Asn Pro Glu Gly Lys Ala Leu 
70 100 105 110 

Leu Lys Lys Thr Lys Asn Ser Glu Glu Phe Ala Ala Ala Met Ser Arg 
115 120 125 

75 

Tyr Glu Leu Lys Leu Ala lie Pro Glu Gly Lys Gin Val Phe Leu Tyr 



61 

"0 135 14Q 

5 Pro Gl» Lys Asp Glu Pro Thr Tyr lie Leu Asn He Lys Arg Gly He 

He Ser Ala Leu Leu Val Pro Pro Glu Thr Glu Glu Ala Lys Gin Val 
10 170 175 

Leu Phe Leu Asp Thr Val Tyr Gly Asn Cys Ser Thr His Phe Thr Val 

185 190 

15 

Lys Thr Arg Lys Gly Asn Val Ala Thr Glu He Ser Thr Glu Arg Asp 



205 



20 Leu Gly Gin Cys Asp Arg Phe Lys Pro He Arg Thr Gly He Ser Pro 



220 



25 22*5 116 **» S£ Met «*■ **■ **° Leu Ser Thr Leu He Ser 

235 240 

^ Ser Ser Gin Ser Cys Gin Tyr Thr Leu Asp Ala Lys Arg Lys His Val 



Ala Glu Ala He Cys Lys Glu Gin His Leu Phe Leu Pro Phe Ser Tyr 

^o5 270 

35 

Asn Asn Lys Tyr Gly Met Val Ala Gin Val Thr Gin Thr Leu Lys Leu 



285 



40 Glu Asp Thr Pro Lys He Asn Ser Arg Phe Phe Gly Glu Gly Thr Lys 



300 



45 3»1 M6t Gly LSU Ala f he Glu Ser «« «*■ Ser Thr Ser Pro Pro Lys 

315 320 

Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys Leu Thr 
50 33 ° 33S 

He Ser Glu Gin Asn He Gin Arg Ala Asn Leu Phe Asn Lys Leu Val 

345 350 

55 

Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu Leu Pro 

360 365 

60 Gin Leu He Glu Val ser Ser Pro He Thr Leu Gin Ala Leu Val Gin 

380 

65 % ° ly Gln Pr ° Gln S « »r Hi. »• jeu Gin Trp Leu Lys Arg 

Val His Ala Asn Pro Leu Leu He Asp Val Val Thr Tyr Leu Val Ala 
70 410 415 

Leu He Pro Glu Pro Ser Ala Gln Gln Leu Arg Glu He Phe Asn Met 

425 430 

75 

Ala Arg Asp Gln Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser His Ala 



445 



62 



Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu Leu Leu 
450 455 460 

Asp He Ala Asn Tyr Leu Met Glu Gin He Gin Asp Asp Cys Thr Gly 
465 470 475 480 

Asp Glu Asp Tyr Thr Tyr Leu He Leu Arg Val He Gly Asn Met Gly 
485 490 495 

Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser He Leu Lys 
500 505 510 

Cys Val Gin Ser Thr Lys Pro Ser Leu Met He Gin Lys Ala Ala He 
515 520 525 

Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu Val Leu 
530 535 540 

Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg Leu Ala 
545 550 555 560 

Ala Tyr Leu Met Leu Met Arg Ser Pro Ser Gin Ala Asp lie Asn Lys 
565 570 ~ 575 

He Val Gin He Leu Pro Trp Glu Gin Asn Glu Gin Val Lys Asn Phe 
580 585 590 

Val Ala Ser His He Ala Asn He Leu Asn Ser Glu Glu Leu Asp He 
595 600 605 

Gin Asp Leu Lys Lys Leu Val Lys Glu Ala Leu Lys Glu Ser Gin Leu 
610 615 620 

Pro Thr Val Met Asp Phe Arg Lys Phe Ser Arg Asn Tyr Gin Leu Tyr 
625 630 635 640 

Lys Ser Val Ser Leu Pro Ser Leu Asp Pro Ala Ser Ala Lys He Glu 
645 650 655 

Gly Asn Leu He Phe Asp Pro Asn Asn Tyr Leu Pro Lys Glu Ser Met 
660 665 670 

Leu Lys Thr Thr Leu Thr Ala Phe Gly Phe Ala Ser Ala Asp Leu He 
675 680 685 

Glu He Gly Leu Glu Gly Lys Gly Phe Glu Pro Thr Leu Glu Ala Leu 
690 695 700 

Phe Gly Lys Gin Gly Phe Phe Pro Asp Ser Val Asn Lys Ala Leu Tyr 
705 710 715 720 

Trp Val Asn Gly Gin Val Pro Asp Gly Val Ser Lys Val Leu Val Asp 
725 730 735 

His Phe Gly Tyr Thr Lys Asp Asp Lys His Glu Gin Asp Met Val Asn 
740 745 750 



63 

Gly lie Met Leu Ser Val Glu Lys Leu lie Lys Asp Leu Lys Ser Lys 
/3:> 760 765 

G1U yii Pr ° G1U Ala A Jf Leu ^9 Leu Gly Glu Glu Leu 
Gly Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu Leu Leu 

Met Gly Ala Arg Thr Leu Gin Gly lie Pro Gin Met lie Gly Glu Val 
805 810 815 

lie Arg Lys Gly Ser Lys Asn Asp Phe Phe Leu His Tyr lie Phe Met 
820 825 830 

Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu Gin lie 
«■»= 840 845 

Ser ser Ser Gly Val He Ala Pro Gly Ala Lys Ala Gly Val Lys Leu 
w 855 

Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser Val Ser 

Val Glu Phe Val Thr Asn Met Gly He He He Pro Asp Phe Ala Arg 
885 890 -895 

Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly Leu Glu 
900 905 g 10 

Ala His val Ala Leu Lys Ala Gly Lys Leu Lys Phe He He Pro Ser 

920 925 

Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu His Leu 
* JO 935 940 

Val ser Thr Thr Lys Thr Glu Val He Pro Pro Leu He Glu Asn Arg 

950 955 ggo 

Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn Tyr Cys 
» 65 970 975 

Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala Ser Tyr 
980 985 990 

Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg Pro Thr 
9 " 1000 1005 

GlY lllO ° ln *** ?nf, Val Ser Ala Thr Glu Leu Gin ' 

l " xv 1015 1020 

*** G 025 ASP Ala LSU Y a L Asp Thr Leu ^e val Thr Gin 
u " 1030 1035 

^40 Gly ^ ° ln wis G1U Ala Thr Met JJ| Lys Tyr 
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Asn Arg Gin Ser Met Thr Leu Ser Ser Glu Val Gin He Pro Asp 
1055 1060 1065 

5 Phe Asp Val Asp Leu Gly Thr He Leu Arg Val Asn Asp Glu Ser 
1070 1075 1080 

Thr Glu Gly Lys Thr Ser Tyr Arg Leu Thr Leu Asp He Gin Asn 
10 1085 1090 1095 



15 



20 



35 



40 



55 



60 



75 



Lys Lys He Thr Glu Val Ala Leu Met Gly His Leu Ser Cys Asp 
1100 1105 1110 

Thr Lys Glu Glu Arg Lys He Lys Gly Val He Ser He Pro Arg 
1115 1120 1125 

Leu Gin Ala Glu Ala Arg Ser Glu He Leu Ala His Trp Ser Pro 
1130 1135 1140 



25 Ala Lys Leu Leu Leu Gin Met Asp Ser Ser Ala Thr Ala Tyr Gly 
1145 1150 1155 

Ser Thr Val Ser Lys Arg Val Ala Trp His Tyr Asp Glu Glu Lys 
30 1160 1165 1170 



He Glu Phe Glu Trp Asn Thr Gly Thr Asn Val Asp Thr Lys Lys 
1175 1180 1185 

Met Thr Ser Asn Phe Pro Val Asp Leu Ser Asp Tyr Pro Lys Ser 
1190 1195 1200 

Leu His Met Tyr Ala Asn Arg Leu Leu Asp His Arg Val Pro Glu 
1205 1210 1215 



45 Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu He Val Ala 
1220 1225 1230 

Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro Tyr Thr 
50 1235 1240 1245 



Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn Leu 
1250 . 1255 1260 

Gin . Asn Met Gly Leu Pro Asp Phe His He Pro Glu Asn Leu Phe 
1265 1-270 1275 

Leu Lys Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser 
1280 1285 1290 



65 Leu Lys He Glu He Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg 
1295 1300 1305 

Asp Leu Lys Met Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe 
70 1310 1315 1320 



Lys Ser Val Gly Phe His Leu Pro Ser Arg Glu Phe Gin Val Pro 
1325 1330 1335 

Thr Phe Thr He Pro Lys Leu Tyr Gin Leu Gin Val Pro Leu Leu 



1340 



1345 



65 

1350 



5 itls LeU ASP Leu Ser ™£ n Asn Val 'Vr Ser Asn Leu Tyr Asn 

1360 1365 

^ f!?0 ^ SSr Gly Asn Ser *" Asp His Phe 

10 1375 1380 



15 



Ser Leu s Arg Ala Arg Tyr HiS Q Met Lys Ala Asp Ser^ Val Val Asp 

Leu iloo ser *** Asn Val ?is 5 Gly Ser Gly Glu Thr «** *«P 

20 His Ly^ Asn Thr Phe Thr Le^ Ser Cys Asp Gly Ser, Leu Arg His 
25 ^ JSo lSU *• Ser Asn 3f 5 *>y* »» ser His Val Q Glu Lys Leu 
3Q 144*5 Val SSr ftfo Gly Leu Le « PJ| s Asp Ala Ser 



35 



fllo ^ ^ ^ Gln fS s Ser Ala S ~ Val HiS Q Leu Asp Ser 
Lys Vy^ Lys Glu His Leu Ph^ Val Lys Glu Val LyS g He Asp Gly 
40 Gin Phe Q Arg Val Ser Ser Ph^ Tyr Ala Lys Gly Th^ Tyr Gly Leu 

45 Ser SS 5 Gln **" ASP Pr ° A I? 0 Thr Gly Arg teu £ J 5 Gly Glu Ser 

Asn Leu Arg Phe Asn Ser Ser Tyr Leu Gln Gly Thr Asn Gln He 
50 1525 1530 

^ M35 *** G1U ASP Thr Leu Ser Thr Ser Thr Ser 

1540 1545 



55 

^ ffi 0 ^ GlY 116 f|f 5 Lys Asn Thr Al« f|| 0 Leu Lys Tyr 



60 Glu Asn_ Tyr Glu Leu Thr LeU Q Lys Ser Asp Thr Asn Gly Lys Tyr 



1575 

65- Lys ilao Phe Ala Ser A |S 5 Lys Met Asp Met * h f 0 phe s « *>y* 

Gln Asn Ala Leu Leu Arg Ser Glu Tyr Gln Ala Asp Tyr Glu Ser 
70 1600 1605 

Leu Arg Phe Phe Ser Leu Lev. Ser Gly Ser Leu Asn ser His Gly 
x " 1615 1620 

75 

Leu Glu Leu Asn Ala Asp lie Leu Gly Thr Asp Lys He Asn Ser 
°" 1630 1535 



10 



25 



30 



45 



50 



65 



70 



66 



Gly Ala His Lys Ala Thr Leu Arg lie Gly Gin Asp Gly lie Ser 
1640 1645 1650 

Thr Ser Ala Thr Thr Asn Leu Lys Cys Ser Leu Leu Val Leu Glu 
1655 1660 1665 

Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser Gly Ala Ser Met Lys 
1670 1675 1680 



15 Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn Ala Lys Phe Ser 
1685 1690 1695 

Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu Gly Ser Ala 
20 1700 1705 1710 



Tyr Gin Ala Met lie Leu Gly Val Asp Ser Lys Asn He Phe Asn 
1715 1720 1725 

Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met Met 
1730 1735 1740 

Gly Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn 
1745 1750 1755 



35 He Ala Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn lie 
1760 1765 1770 

Tyr Ser Ser Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu 
40 1775 1780 1785 



Gin Pro Tyr Ser Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr 
1790 1795 1800 

Asn Ala Leu Asp Leu Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro 
1805 1810 1815 

Leu Lys Leu His Val Ala Gly Asn Leu Lys Gly Ala Tyr Gin Asn 
1820 1825 1830 



55 Asn Glu He Lys His He Tyr Ala He Ser Ser Ala Ala Leu Ser 
1835 1840 1845 

Ala Ser Tyr Lys Ala Asp Thr Val Ala Lys Val Gin Gly Val Glu 
60 1850 1855 1860 



Phe Ser His Arg Leu Asn Thr Asp He Ala Gly Leu -Ala Ser Ala 
1865 1870 1875 

He Asp Met Ser Thr Asn Tyr Asn Ser Asp Ser Leu His Phe Ser 
1880 1885 1890 

Asn Val Phe Arg Ser Val Met Ala Pro Phe Thr Met Thr He Asp 
1895 1900 1905 



75 Ala His Thr Asn Gly Asn Gly Lys Leu Ala Leu Trp Gly Glu His 
1910 1915 1920 
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Thr Gln LeU Ser « Phe Leu Leu L ^ s Ala Glu 

1^25 1930 1935 

Ala ?£!f« Thr phe Ser His As P ^ Lys Gly Ser Thr Ser His His 
1940 1945 1950 

Leu Val Ser Arg Lys Ser He Ser Ala Ala Leu Glu His Lys Val 
J-955 i960 1965 

Ser ?o*n Leu Leu Thr Pro Ala Glu Gln Thr Trp Lys Leu 

1970 1975 1980 

LyS Gln Phe Asn Asn Asn Glu ^ Ser Gln Asp Leu Asp Ala 

1985 1990 1995 

9nnn Thr LyS ASP LyS Ile GIy Glu Leu Thr Gly Arg Thr 

2000 2005 2010 

LeU £n a « ASP Leu ^ Leu Leu As P Ser Pro Lys Val Pro Leu 
2015 2020 2025 

L6U SSr G1U Pr ° Ile A S* Ile Ile As P Ala Leu Glu Met Arg 

2030 2035 2040 

ASP m« ° 1U LyS Pr ° ?i n Glu Phe Thr lle Val A ** Phe Val 

2045 2050 2055 

LyS ££n ASP LyS Asn Gln Asp Val His Ser Ile A *n Leu Pro Phe 
2050 2065 2070 

PHe ™Sr Thr LeU Gln Glu phe Glu A ^n Arg Gln Thr Ile 

2075 2080 2085 

116 2090 G1U ASn Y*i Gln *** Asn Leu H « He Asn 

2090 2095 2100 

Ile Asp Gln Phe Val Arg Lys Tyr Arg Ala Ala Leu Gly Lys Leu 
2105 2110 2115 

Gln Ala Asn Asp Leu Asn Ser Asn Trp Glu Arg 

2120 2125 2130 

Gln Ser His Ala Lys Glu ' Lys Leu Thr Ala Leu Thr Lys Lys 

2135 2140 2145 

n Thr ° 1U ASn ASP Ile Gln Ile Ala Leu Asp Asp Ala 

2150 2155 2160 

LyS c Asn Phe Asn Glu Lys Leu Ser Gln Leu Gln Thr Tyr Met 
2165 2170 2175 

f 1R0 ^ ASP ™ LyS ASP Ser ^ r AS P L eu His ASP 

2180 2185 2190 

LeU SffS* 116 Ala " e Ala ^ Ile Ile As P Glu Ile He Glu Lys 
Z - L « 2200 2205 
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Leu kys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu Val 
2210 2215 2220 

5 Lys Thr He His Asp Leu His Leu Phe lie Glu Asn He Asp Phe 
2225 2230 2235 

Asn Lys Ser Gly Ser Ser Thr Ala Ser Trp He Gin Asn Val Asp 
10 2240 2245 2250 



15 



20 



35 



40 



55 



60 



75 



Thr Lys Tyr Gin He Arg He Gin He Gin Glu Lys Leu Gin Gin 
2255 2260 2265 

Leu Lys Arg His He Gin Asn He Asp He Gin His Leu Ala Gly 
2270 2275 2280 

Lys Leu Lys Gin His He Glu Ala He Asp Val Arg Val Leu Leu 
2285 2290 2295 



25 Asp Gin Leu Gly Thr Thr He Ser Phe Glu Arg He Asn Asp Val 
2300 2305 2310 

Leu Glu His Val Lys His Phe Val He Asn Leu He ' Gly Asp Phe 
30 2315 2320 2325 



Glu Val Ala Glu Lys He Asn Ala Phe Arg Ala Lys Val His Glu 
2330 2335 2340 

Leu He Glu Arg Tyr Glu Val Asp Gin Gin He Gin Val Leu Met 
2345 2350 2355 

Asp Lys Leu Val Glu Leu Thr His Gin Tyr Lys Leu Lys Glu Thr 
2360 2365 2370 



45 He Gin Lys Leu Ser Asn Val Leu Gin Gin Val Lys He Lys Asp 
2375 2380 2385 

^ Tyr Phe Glu Lys Leu Val Gly Phe He Asp Asp Ala Val Lys Lys 
50 2390 2395 2400 



Leu Asn Glu Leu Ser Phe Lys Thr Phe He Glu Asp Val Asn Lys 
2405 2410 2415 

Phe Leu Asp Met Leu He Lys Lys Leu Lys Ser Phe Asp Tyr His 
2420 2425 2430 

Gin Phe Val Asp Glu Thr Asn Asp Lys He Arg Glu Val Thr Gin 
2435 2440 2445 



65 Arg Leu Asn Gly Glu He Gin Ala Leu Glu Leu Pro Gin Lys Ala 
2450 2455 2460 

Glu Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala 
70 2465 2470 2475 



Val Tyr Leu Glu Ser Leu Gin Asp Thr Lys He Thr Leu He He 
2480 2485 2490 

Asn Trp Leu Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met 
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2495 2500 2505 

Lys Ala Lys Phe Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met 
2510 2515 2520 

Tyr Gin Met Asp He Gin Gin Glu Leu Gin Arg Tyr Leu Ser Leu 
2525 2530 2535 

Val Gly Gin Val Tyr Ser Thr Leu Val Thr Tyr He Ser Asp Trp 
2540 2545 2550 

Trp Thr Leu Ala Ala Lys Asn Leu Thr Asp Phe Ala Glu Gin Tyr 
2555 2560 2565 

Ser lie Gin Asp Trp Ala Lys Arg Met Lys Ala Leu Val Glu Gin 
2570 2575 2580 

Gly SML Thr Val pro Glu Ile L V S Thr Ile Leu Gly Thr Met Pro 
2585 2590 2595 

Ala ~® Glu Val Ser Leu Gln Ala Leu Gin Lys Ala Thr Phe Gin 
2600 2605 2610 

Thr Asp Phe Xle Val pro Leu Thr Asp Leu Arg Ile Pro Ser 

2615 2620 2625 

Val 5i n A Ile Asn phe Lys Asp Leu Asn lie Lys Ile Pro Ser 

2630 2635 2640 

Arg Phe Ser Thr Pro Glu Phe Thr Ile Leu Asn Thr Phe His lie 
2645 2650 2655 

Pro o e 5^ Phe * tor Ile Asp Phe Val Glu Met Lys Val Lys Ile lie 
2660 2665 2670 

Arg Thr lie Asp Gin Met Gin Asn Ser Glu Leu Gin Trp Pro Val 
2675 2680 2685 

Pro Asp He Tyr Leu Arg Asp Leu Lys Val Glu Asp lie Pro Leu 
2690 2695 2700 

Ala Arg lie Thr Leu Pro Asp Phe Arg Leu Pro Glu lie Ala Ile 
2705 2710 2715 

Pro Glu Phe Ile Ile Pro Thr Leu Asn Leu Asn Asp Phe Gin Val 
2720 2725 2730 

Pro Asp Leu His Ile Pro Glu Phe Gin Leu Pro His lie Ser His 
2735 2740 2745 

Thr lie Glu Val Pro Thr Phe Gly Lys Leu Tyr Ser He Leu Lys 
2750 2755 2760 

lle Si— Ser Pro Leu Phe Thr Leu Asp Ala Asn Ala Asp He Gly 
2765 2770 2775 

Asn Thr Thr Ser Ala Asn Glu Ala Gly He Ala Ala Ser Ile 

2780 2785 2790 
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Thr Ala Lys Gly Glu Ser Lys Leu Glu Val Leu Asn Phe Asp Phe 
2795 2800 ^HU3 

5. 

Gin Ala Asn Ala Gin Leu Ser Asn Pro Lys He Asn Pro Leu Ala 
2810 2815 2820 

10 Leu Lys Glu Ser Val Lys Phe Ser Ser Lys Tyr Leu Arg Thr Glu 
2825 2830 2835 

15 His Gly Ser Glu Met Leu Phe Phe Gly Asn Ala lie Glu Gly Lys 
2840 2845 2850 

ser Asn Thr Val Ala Ser Leu His Thr Glu Lys Asn Thr Leu Glu 
20 2855 2860 2865 

Leu Ser Asn Gly Val He Val Lys He Asn Asn Gin Leu Thr Leu 
2870 2875 2880 

25 

Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn IjLe Pro Lys Leu 
2.88S 2890 28 »5 

30 Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu lie Lys Thr Leu 
2900 2905 ^ ±yj 

35 Leu Lys s Ala Gly His He Al^ Trp Thr Ser Ser ol^ Lys Gly Ser 

Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His Glu 
40 2930 2935 294° 

Ser Gin He Ser Phe Thr He Glu Gly Pro Leu Thr Ser Phe Gly 
2945 2950 2955 

45 

Leu Ser Asn Lys He Asn Ser Lys His Leu Arg Val Asn Gin Asn 
2960 2965 2 

50 Leu Val Tyr Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu He 
2975 2980 2985 

55 Gin Ser Gin Val Asp Ser Gin His Val Gly His Ser Val Leu Thr 
2990 2995 

Ala Lys Gly Met Ala Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr 
60 3005 301° 3015 

Gly Arg His Asp Ala His Leu Asn Gly Lys Val lie Gly Thr Leu 
3020 3025 

65 

Lys Asn Ser Leu Phe Phe Ser Ala Gin Pro Phe Glu He Thr Ala 
3035 3040 J"* 3 

70 Ser Thr Asn Asn Glu Gly Asn Leu Lys Val Arg Phe Pro Leu Arg 
3050 3055 JUDU 

75 Leu Thr Gly Lys He Asp Phe Leu Asn Asn Tyr Ala Leu Phe Leu 
3065 3070 
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Ser ? r °« Ser Ala Gln Gln Ala Ser Tr P 61n Val Ser Ala Arg Phe 
3080 3085 3090 

ASn Lys Asn Gln Asn Phe Ser Ala Gly Asn Asn Glu 

3095 3100 3105 

Asn lie Met Glu Ala His Val Gly He Asn Gly Glu Ala Asn Leu 
3110 3115 3120 

ASP LeU Asn Ile Pro Leu Thr Ile Pro G1 u Met Arg Leu Pro 

31^5 3130 3 135 

Ile I1S Thr Thr Pro Pro Leu Lys Aso Phe Ser Leu Trp 
3140 3145 3150 

Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys Gln Ser 
3155 3160 3i6 5 

Phe Asp Leu Ser Val Lys Ala Gln Tyr Lys Lys Asn Lys His Arg 
3170 3175 3180 

HiS ??L Ile Thr Asn Pro Leu Ala Val Leu c y s <31u Phe lie Ser 
3l»5 3190 3195 

Gln ??5o hYS Ser PhS ^5 His phe Glu L * s Asn Arg Asn 

3200 3205 3210 

Asn Ala Leu Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys lie 
3215 3220 3225 

LyS ?o?n ASP LyS LyS Ala Glu Ser His Asp Glu Leu Pro 

3230 3235 3240 

Arg Thr Phe Gln Ile Pro Gly Tyr Thr Val. Pro Val Val Asn Val 
3245 3250 3255 

G1U Y?L Ser Pro Phe Thr Ile Glu Met Ser Ala Phe Gly Tyr Val 
3260 3265 3270 

PhS ? r ? c LyS Ala Val Ser Met Pro Ser Phe Ser Ile Leu Gly Ser 
3275 3280 3285 

ASP YoL Val Pro Ser Thr Leu Ile Pro Ser Leu Glu 

3290 3295 3300 

LeU o?S e Val Leu His Val Pro Asn Leu Lys Leu Ser Leu Pro 

3305 3310 3315 

HiS LyS Glu Leu Thr Ile Ser His Ile p ^e Ile Pro Ala 

3320 3325 3330 

Met Asn Ile Thr ^ Asp Phe Ser phe L ys Ser Ser Val He 

3335 % 3340 3345 

Thr ^?cn Asn Tnr Asn Ala Glu Leu Phe Asn Gln Ser Asp lie Val 
3350 3355 3350 
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Ala His Leu Leu Ser Ser Ser Ser Ser Y^, Ile As P Ala LeU Gln 
3365 3370 33 

5 Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys Arg Gly Leu 
3380 3385 

Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val Glu Gly 
10 3395 3400 3405 

Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu Val 
3410 3415 J«zu 

15 

Ser Val Ala Lys Thr Thr Lys Ala Glu He Pro lie Leu Arg Met 
3425 3430 3435 



20 



Asn Phe Lys Gin Glu Leu Asn Gly Asn Thr Lys Ser Lys Pro Thr 
3440 3445 J,w 



25 Val Ser Ser Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met 
3455 3460 3465 

Leu Tyr Ser Thr Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu 
30 3470 3475 3480 

Glu Ser Leu Thr Ser Tyr Phe Ser He Glu Ser Ser Thr Lys Gly 
3485 3490 3495 

35 

Asp Val Lys Gly Ser Val Leu Ser Arg Glu Tyr Ser Gly Thr He 
3500 3505 3510 

40 Ala Ser Glu Ala Asn Thr Tyr Leu Asn Ser Lys Ser Thr Arg Ser 
3515 3520 3525 

45 Ser val Lys Leu Gin Gly Thr Ser Lys He Asp Asp He Trp Asn 
3530 3535 35«u 

Leu Glu Val Lys Glu Asn Phe Ala Gly Glu Ala Thr Leu Gin Arg 
50 3545 3550 3555 

He Tyr ser Leu Trp Glu His Ser Thr Lys Asn His Leu Gin Leu 

3560 3565 " /u 

55 

Glu Gly Leu Phe Phe Thr Asr^ Gly Glu His Thr Ser. Lys Ala Thr 
3 575 

60 Leu Glu Leu Ser Pro Trp Gin Met Ser Ala Leu Val Gin Val His 

3590 3595 3600 

65 Ala ser Gin Pro Ser Ser Phe His Asp Phe Pro Asp Leu Gly Gin 
3605 3610 3DJ.5 

Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys He Arg Trp 
70 3620 3625 3630 



Lys Asn Glu val Arg He His Ser Gly Ser Phe Gin Ser Gin Val 
3635 3640 3645 

75 

Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp He Ala Gly 
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3650 3655 3660 

Ser Leu Glu Gly His Leu Arg Phe Leu Lys Asn lie He Leu Pro 
3665 3670 3675 

Val Tyr Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr 
3680 3685 3690 

Thr Ser lie Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe 
3695 3700 3705 

Val Tyr Thr Lys Asn Pro Asn Gly Tyr Ser Phe Ser He Pro Val 
3710 3715 3720 

Lys Val Leu Ala Asp Lys Phe He Thr Pro Gly Leu Lys Leu Asn 
3725 3730 3735 

Asp Leu Asn Ser Val Leu Val Met Pro Thr Phe His Val Pro Phe 
3740 3745 3750 

Thr Asp Leu Gin Val Pro Ser Cys Lys Leu Asp Phe Arg Glu lie 
3755 3760 3765 

Gin He Tyr Lys Lys Leu Arg Thr Ser Ser Phe Ala Leu Asn Leu 
3770 3775 3780 

Pro Thr Leu Pro Glu Val Lys Phe Pro Glu Val Asp Val Leu Thr 
3785 3790 3795 

Lys Tyr Ser Gin Pro Glu Asp Ser Leu He Pro Phe Phe Glu He 
3800 3805 3810 

Thr Val Pro Glu Ser Gin Leu Thr Val Ser Gin Phe Thr Leu Pro 
3815 3820 3825 

Lys Ser Val Ser Asp Gly He Ala Ala Leu Asp Leu Asn Ala Val 
3830 3835 3840 

Ala Asn Lys He Ala Asp Phe Glu Leu Pro Thr He He Val Pro 
3845 3850 3855 

Glu Gin Thr He Glu He Pro Ser He Lys Phe Ser Val Pro Ala 
3860 3865 3870 

Gly He Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg Phe Glu 
3875 3880 3885 

Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu Lys 
3890 3895 3900 

Asn Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser 
3905 3910 3915 

Ser Thr Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr 
3920 3925 3930 



His Lys He Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr 
3935 3940 3945 
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Leu Ala His Arg Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys 
3950 3955 3960 

5 

Phe Glu Gly Leu Gin Glu Trp Glu Gly Lys Ala His Leu Asn lie 
3965 3970 3975 

10 Lys Ser Pro Ala Phe Thr Asp Leu His Leu Arg Tyr Gin Lys Asp 
y 3980 3985 3990 

15 Lys Lys Gly He Ser Thr Ser Ala Ala Ser Pro Ala Val Gly Thr 
3995 4000 4005 

Val Gly Met Asp Met Asp Glu Asp Asp Asp Phe Ser Lys Trp Asn 
20 4010 4015 4020 

Phe Tyr Tyr Ser Pro Gin Ser Ser Pro Asp Lys Lys Leu Thr He 
4025 4030 4035 

25 

Phe Lys Thr Glu Leu Arg Val Arg Glu Ser Asp Glu Glu Thr Gin 
4040 4045 405O 

30 lie Lys Val Asn Trp Glu Glu Glu Ala Ala Ser Gly Leu Leu Thr 
4055 4060 4065 

35 Ser Leu Lys Asp Asn Val Pro Lys Ala Thr Gly Val Leu Tyr Asp 
4070 4075 4080 

Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly Leu Thr Leu Arg 
40 4085 " 4090 4095 

Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn Asn Ala Glu 
4100 4105 4110 

45 

Trp Val Tyr Gin Gly Ala He Arg Gin He Asp Asp He Asp Val 
4115 4120 4X25 

50 Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin Glu 
4130 4135 

55 Trp Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin 
4145 4150 

Glu Gly Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp 
60 4160 4165 4170 

Gly Leu Val Arg Val Thr Gin Lys Phe His Met Lys Val Lys His 
4175 4180 • 4185 

65 

Leu lie Asp Ser Leu He Asp Phe Leu Asn Phe Pro Arg Phe Gin 
4190 4195 4200 

70 Phe Pro Gly Lys Pro Gly He Tyr Thr Arg Glu Glu Leu Cys Thr 
4205 4210 4 215 

75 Met Phe He Arg Glu Val Gly Thr Val Leu Ser Gin Val Tyr Ser 
4220 4225 4230 



75 

LyS Y^L His Asn Gly Ser Glu IIe Leu *he Ser Tyr Phe Gin Asp 
4235 4240 4245 

LeU Y?L Ile Thr Leu Pro phe Glu Leu **9 L ys His Lys Leu lie 
4250 4255 4260 

^ Y?L Ile Ser Ket Glu Leu Leu L ys Asp Leu Ser Lys 

4265 4270 4275 

Glu ^t* Gln Glu Val phe Lys Ala Ile Gln Ser Leu Lys Thr Thr 
4280 4285 4290 

Glu YniL Leu ^ Asn Leu Gln As P Leu Leu Gln Phe Ile Phe Gln 
4295 4300 4305 

L6U 61u Asp Asn Ile hys Gln Leu L y s Glu Met Lys Phe Thr 

4310 4315 4320 

^= Ile Asn Ile Gln Glu Ile As * Thr lie Phe Asn 

4325 4330 4335 

* SP Ile Pro Val Phe Lys Leu Leu L VS Glu Asn Leu Cys 

4340 4345 4350 

Leu Asn Leu His Lys Phe Asn Glu Phe lie Gln Asn Glu Leu Glo 
4355 4360 4365 

G1U Ser Gln Glu * Leu Gln Gln xle His Gln Tyr lie Met Ala 

4370 4375 438O 

LSU e Glu Glu TyT Phe As *> pro Ser Ile Va * Gly Trp Thr Val 
4385 4390 4395 

LyS T^nn ^ Glu Leu Glu Glu Ile Val Ser *^u He Lys Asn 

4400 4405 4410 

LeU Val Ala Leu hys As P phe His Ser Glu Tyr Ile Val Ser 

44 15 4420 4425 

Ala ? e ?n Asn Phe Thr Ser Gln Leu Ser Ser Gln v al Glu Gln Phe 
4430 4435 4440 

L ® U Arg Asn Ile Gln Glu ^ Leu Ser Ile Leu Thr Asp Pro 

444 5 4450 4455 

Asp Lys Gly Lys Glu L y s Ile Ala Glu Leu Ser Ala Thr Ala 

4460 4465 4470 

Gln Ile Ile Lys Ser Gln Ala Ile Ala Thr Lys Lys Ile lie 

4475 4480 4485 

4490 4495 *** ^ 45S S ® r 

ASP Leu Ser Asp ^ Glu L y s Phe Ala Glu Ser Lys 

4 505 4510 4515 
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Arg Leu He Asp Leu Ser He Gin Asn ^« is Thr Phe LeU 
S 4520 4525 4530 

5 Tyr He Thr Glu Leu Leu Lys Lys Leu Gin Ser Thr Thr Val Met 
4535 4540 4545 

Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu Leu Thr He He Leu 
10 4550 4555 4S&U 

<210> 42 

<211> 930 

15 <212> PRT 

<213> Homo sapiens 

<300> 

<308> Swiss-Prot/Q14624 

20 <309> 1998-07-15 

<313> (1)..(930) 

<400> 42 

25 Met Lys Pro Pro Arg Pro Val Arg Thr Cys Ser Lys Val Leu Val Leu 
X 5 AU 

Leu Ser Leu Leu Ala lie His Gin Thr Thr Thr Ala Glu Lys Asn Gly 
30 20 25 

lie Asp lie Tyr Ser Leu Thr Val. Asp Ser Arg Val Ser Ser Arg Phe 
35 40 " 

35 

Ala His Thr val Val Thr Ser Arg Val Val Asn Arg Ala Asn Thr Val 
50 55 60 

40 Gin Glu Ala Thr Phe Gin Met Glu Leu Pro Lys Lys Ala Phe lie Thr 
65 70 75 

45 Asn Phe Ser Met Asn He Asp Gly Met Thr Tyr Pro Gly He lie Lys 
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Glu Lys Ala Glu Ala Gin Ala Gin Tyr Ser Ala Ala Val Ala Lys Gly 
50 ioo 105 AiU 

Lys Ser Ala Gly Leu Val Lys Ala Thr Gly Arg Asn Met Glu Gin Phe 
115 120 125 

55 

Gin Val Ser Val Ser Val Ala Pro Asn Ala Lys lie Thr Phe Glu Leu 
130 "5 1*0 

60 Val Tyr Glu Glu Leu Leu Lys Arg Arg Leu Gly Val Tyr Glu Leu Leu 
145 150 

65 Leu Lys Val Arg Pro Gin Gin Leu Val Lys His Leu Gin Met Asp He 

165 1"0 

His He Phe Glu Pro Gin Gly He Ser Phe Leu Glu Thr Glu Ser Thr 
70 180 185 1SU 

Phe Met Thr Asn Gin Leu Val Asp Ala Leu Thr Thr Trp Gin Asn Lys 
195 200 

75 

Thr Lys Ala His He Arg Phe Lys Pro Thr Leu Ser Gin Gin Gin Lys 



77 

210 215 220 

Ser Pro Glu Gin Gin Glu Thr Val Leu Asp Gly Asn Leu lie lie Arg 
225 23 ° 235 240 

Tyr Asp Val Asp Arg Ala lie Ser Gly Gly Ser lie Gin He Glu Asn 
245 250 255 

Gly Tyr Phe Val His Tyr Phe Ala Pro Glu Gly Leu Thr Thr Met Pro 
260 265 270 

Lys Asn Val Val Phe Val He Asp Lys Ser Gly Ser Met Ser Gly Arg 
275 280 285 

Lys lie Gin Gin Thr Arg Glu Ala Leu He Lys lie Leu Asp Asp Leu 

Ser Pro Arg Asp Gin Phe Asn Leu He Val Phe Ser Thr Glu Ala Thr 
305 310 315 320 

Gin Trp Arg Pro Ser Leu Val Pro Ala Ser Ala Glu Asn Val Asn Lys 
325 330 335 

Ala Arg Ser Phe Ala Ala Gly He Gin Ala Leu Gly Gly Thr Asn He 
340 345 350 

Asn Asp Ala Met Leu Met Ala Val Gin Leu Leu Asp Ser Ser Asn Gin 
355 360 365 

Glu Glu Arg Leu Pro Glu Gly Ser Val Ser Leu He He Leu Leu Thr 
370 375 380 

Asp Gly Asp Pro Thr Val Gly Glu Thr Asn Pro Arg Ser He Gin Asn 
385 39 ° 395 400 

Asn Val Arg Glu Ala Val Ser Gly Arg Tyr Ser Leu Phe Cys Leu Gly 
405 410 415 

Phe Gly Phe Asp Val Ser Tyr Ala Phe Leu Glu Lys Leu Ala Leu Asp 
420 425 430 

Asn Gly Gly Leu Ala Arg Arg He His Glu Asp Ser Asp Ser Ala Leu 
435 440 445 

Gin Leu Gin Asp Phe Tyr Gin Glu Val Ala Asn Pro Leu Leu Thr Ala 
450 455 460 

Val Thr Phe Glu Tyr Pro Ser Asn Ala Val Glu Glu Val Thr Gin Asn 
465 47 ° 475 480 

Asn Phe Arg Leu Leu Phe Lys Gly Ser Glu Met Val Val Ala Glv Lvs 
4 85 490 495 

Leu Gin Asp Arg Gly Pro Asp Val Leu Thr Ala Thr Val Ser Glv Lvs 
500 505 510 



Leu Pro Thr Gin Asn He Thr Phe Gin Thr Glu Ser Ser Val Ala Glu 
515 520 525 
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Gin Glu Ala Glu Phe Gin Ser Pro Lys Tyr He Phe His Asn Phe Met 
530 535 540 

Glu Arg Leu Trp Ala Tyr Leu Thr He Gin Gin Leu Leu Glu Gin Thr 
545 550 555 560 

Val Ser Ala Ser Asp Ala Asp Gin Gin Ala Leu Arg Asn Gin Ala Leu 
565 570 575 

Asn Leu Ser Leu Ala Tyr Ser Phe Val Thr Pro Leu Thr Ser Met Val 
580 585 590 

Val Thr Lys Pro Asp Asp Gin Glu Gin Ser Gin Val Ala Glu Lys Pro 
595 600 605 

Met Glu Gly Glu Ser Arg Asn Arg Asn Val His Ser Gly Ser Thr Phe 
610 615 620 

Phe Lys Tyr Tyr Leu Gin Gly Ala Lys He Pro Lys Pro Glu Ala Ser 
625 630 635 640 

Phe Ser Pro Arg Arg Gly Trp Asn Arg Gin Ala Gly Ala Ala Gly Ser 
645 650 655 

Arg Met Asn Phe Arg Pro Gly Val Leu Ser Ser Arg Gin Leu Gly Leu 
660 ' 665 670 

Pro Gly Pro Pro Asp Val Pro Asp His Ala Ala Tyr His Pro Phe Arg 
675 680 685 

Arg Leu Ala He Leu Pro Ala Ser Ala Pro Pro Ala Thr Ser Asn Pro 
690 695 700 

Asp Pro Ala Val Ser Arg Val Met Asn Met Lys He Glu Glu Thr Thr 
705 710 715 720 

Met Thr Thr Gin Thr Pro Ala Pro He Gin Ala Pro Ser Ala He Leu 
725 730 735 

Pro Leu Pro Gly Gin Ser Val Glu Arg Leu Cys Val Asp Pro Arg His 
740 745 750 

Arg Gin Gly Pro Val Asn Leu Leu Ser Asp Pro Glu Gin Gly Val Glu 
755 760 765 

Val Thr Gly Gin Tyr Glu Arg Glu Lys Ala Gly Phe Ser Trp He Glu 
770 775 780 

Val Thr Phe Lys Asn Pro Leu Val Trp Val His Ala Ser Pro Glu His 
785 790 795 800 

Val Val Val Thr Arg Asn Arg Arg Ser Ser Ala Tyr Lys Trp Lys Glu 
805 810 815 

Thr Leu Phe Ser Val Met Pro Gly Leu Lys Met Thr Met Asp Lys Thr 
820 825 830 
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Gly Leu Leu Leu Leu Ser Asp Pro Asp Lys Val Thr lie Gly Leu Leu 
835 840 845 

Phe Trp Asp Gly Arg Gly Glu Gly Leu Arg Leu Leu Leu Arg Asp Thr 
850 855 860 

Asp Arg Phe Ser Ser His Val Gly Gly Thr Leu Gly Gin Phe Tyr Gin 
865 870 875 880 

Glu Val Leu Trp Gly Ser Pro Ala Ala Ser Asp Asp Gly Arg Arg Thr 
885 890 895 

Leu Arg Val Gin Gly Asn Asp His Ser Ala Thr Arg Glu Arg Arg Leu 
900 905 910 

Asp Tyr Gin Glu Gly Pro Pro Gly Val Glu lie Ser Cys Trp Ser Val 
915 920 925 

Glu Leu 
930 
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43 


<211> 


1744 


<212> 
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<213> 
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<300> 




<308> 
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<309> 
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<313> 


(1) 


<400> 


43 


Met Arg Leu 


1. 





5 10 15 

Ser Leu Gin Lys Pro Arg Leu Leu Leu Phe Ser Pro Ser Val Val His 
20 25 30 

Leu Gly Val Pro Leu Ser Val Gly Val Gin Leu Gin Asp Val Pro Arg 
35 40 45 

Gly Gin Val Val Lys Gly Ser Val Phe Leu Arg Asn Pro Ser Arg Asn 
50 55 60 

Asn Val Pro Cys Ser Pro Lys Val Asp Phe Thr Leu Ser Ser Glu Arg 
65 70 75 80' 

Asp Phe Ala Leu Leu Ser Leu Gin Val Pro Leu Lys Asp Ala Lys Ser 
85 -90 95 

Cys Gly Leu His Gin Leu Leu Arg Gly Pro Glu Val Gin Leu Val Ala 
100 105 110 

His Ser Pro Trp Leu Lys Asp Ser Leu Ser Arg Thr Thr Asn lie Gin 
115 120 125 

Gly lie Asn Leu Leu Phe Ser Ser Arg Arg Gly His Leu Phe Leu Gin 
130 135 140 
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Thr Asp Gin Pro lie Tyr Asn Pro Gly Gin Arg Val Arg Tyr Arg Val 
145 150 155 160 

5 Phe Ala Leu Asp Gin Lys Met Arg Pro Ser Thr Asp Thr He Thr Val 

165 170 1'5 

Met Val Glu Asn Ser His Gly Leu Arg Val Arg Lys Lys Glu Val Tyr 
10 180 185 190 



15 



Met Pro Ser Ser He Phe Gin Asp Asp Phe Val He Pro Asp He Ser 
195 200 205 

Glu Pro Gly Thr Trp Lys He Ser Ala Arg Phe Ser Asp Gly Leu Glu 
210 215 220 



20 

Ser Asn Ser Ser Thr Gin Phe Glu Val Lys Lys Tyr Val Leu Pro Asn 
225 230 235 240 

25 Phe Glu Val Lys He Thr Pro Gly Lys Pro Tyr He Leu Thr Val Pro 

245 250 255 

Gly His Leu Asp Glu Met Gin Leu Asp He Gin Ala Arg Tyr He Tyr 
30 260 265 270 

Gly Lys Pro Val Gin Gly Val Ala Tyr Val Arg Phe Gly Leu Leu Asp 
275 280 285 

35 

Glu Asp Gly Lys Lys Thr Phe Phe Arg Gly Leu Glu Ser Gin Thr Lys 
290 295 300 

40 

Leu Val Asn Gly Gin Ser His He Ser Leu Ser Lys Ala Glu Phe Gin 
305 310 315 320 

45 Asp Ala Leu Glu Lys Leu Asn Met Gly He Thr Asp Leu Gin Gly Leu 

325 330 335 

Arg Leu Tyr Val Ala Ala Ala He He Glu Ser Pro Gly Gly Glu Met 
50 340 345 350 

Glu Glu Ala Glu Leu Thr Ser Trp Tyr Phe Val Ser Ser Pro Phe Ser 
355 360 365 

55 

Leu Asp Leu Ser Lys Thr Lys Arg His Leu Val Pro Gly Ala Pro Phe 
370 375 380 

60 Leu Leu Gin Ala Leu Val Arg Glu Met Ser Gly Ser Pro Ala Ser Gly 
385 390 395 400 

65 He Pro Val Lys Val Ser Ala Thr Val Ser Ser Pro Gly Ser Val Pro 

405 410 415 

Glu Val Gin Asp He Gin Gin Asn Thr Asp Gly Ser Gly Gin Val Ser 
70 420 425 430 

He Pro He He He Pro Gin Thr He Ser Glu Leu Gin Leu Ser Val 
435 440 445 

75 

Ser Ala Gly Ser Pro His Pro Ala He Ala Arg Leu Thr Val Ala Ala 
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450 455 460 

Pro Pro Ser Gly Gly Pro Gly Phe Leu Ser lie Glu Arg Pro Asp Ser 
465 4 ™ 475 480 

Arg Pro Pro Arg Val Gly Asp Thr Leu Asn Leu Asn Leu Arg Ala Val 

490 495 

Gly Ser Gly Ala Thr Phe Ser His Tyr Tyr Tyr Met lie Leu Ser Arg 
500 505 5io 

Gly Gin lie Val Phe Met Asn Arg Glu Pro Lys Arg Thr Leu Thr Ser 
515 520 '525 

Val Ser Val Phe Val Asp His His Leu Ala Pro Ser Phe Tyr Phe Val 
530 535 540 

Ala Phe Tyr Tyr His Gly Asp His Pro Val Ala Asn Ser Leu Arg Val 
545 550 555 560 

Asp Val Gin Ala Gly Ala Cys Glu Gly Lys Leu Glu Leu Ser Val Asp 
565 570 575 

Gly Ala Lys Gin Tyr Arg Asn Gly Glu Ser Val Lys Leu His Leu Glu 
580 585 590 

Thr Asp Ser Leu Ala Leu Val Ala Leu Gly Ala Leu Asp Thr Ala Leu 
595 600 605 

Tyr Ala Ala Gly Ser Lys Ser His Lys Pro Leu Asn Met Gly Lys Val 
610 615 620 

Phe Glu Ala Met Asn Ser Tyr Asp Leu Gly Cys Gly Pro Gly Gly Gly 
625 630 635 640 

Asp Ser Ala Leu Gin Val Phe Gin Ala Ala Gly Leu Ala Phe Ser Asp 
645 650 655 

Gly Asp Gin Trp Thr Leu Ser Arg Lys Arg Leu Ser Cys Pro Lys Glu 
660 665 670 

Lys Thr Thr Arg Lys Lys Arg Asn Val Asn Phe Gin Lys Ala lie Asn 
675 680 685 

Glu Lys Leu Gly Gin Tyr Ala Ser Pro Thr Ala Lys Arg Cys Cys Gin 
690 695 700 

Asp Gly Val Thr Arg Leu Pro Met Met Arg Ser Cys Glu Gin Arg Ala 
705 710 715 720 

Ala Arg Val Gin Gin Pro Asp Cys Arg Glu Pro Phe Leu Ser Cys Cys 
725 730 735 

Gin Phe Ala Glu Ser Leu Arg Lys Lys Ser Arg Asp Lys Gly Gin Ala 
740 745 750 



Gly Leu Gin Arg Ala Leu Glu He Leu Gin Glu Glu Asp Leu He Asp 
755 760 765 



82 



Glu Asp Asp lie Pro Val Axg Ser Phe Phe Pro Glu Asn Trp Leu Trp 
770 ? 75 



5 

Arg val Glu Thr Val Asp Arg Phe Gin He Leu Thr Leu Trp Leu Pro 
785 790 



10 Asp ser Leu Thr Thr Trp Glu He His Gly Leu Ser Leu Ser Lys Thr 

805 oiu 

15 Lys Gly Leu Cys Val Ala Thr Pro Val Gin Leu Arg Val Phe Arg Glu 
820 oZ3 

Phe His Leu His Leu Arg Leu Pro Met Ser Val Arg Arg Phe Glu Gin 
20 835 840 845 



Leu Glu Leu Arg Pro Val Leu Tyr Asn Tyr Leu Asp Lys Asn Leu Thr 
850 855 0 

25 



val Ser Val His Val Ser Pro Val Glu Gly Leu Cys Leu Ala Gly Gly 
865 870 



30 Gly Gly Leu Ala Gin Gin Val Leu Val Pro Ala Gly Ser Ala Arg Pro 

885 890 

35 Val Ala Phe Ser Val Val Pro Thr Ala Ala Ala Ala Val Ser Leu Lys 



900 



Val Val Ala Arg Gly Ser Phe 
40 915 



Glu Phe Pro Val Gly Asp Ala Val Ser 
920 925 



Lys Val Leu Gin He Glu Lys Glu Gly Ala He His Arg Glu Glu Leu 
930 935 * U 

45 

Val Tyr Glu Leu Asn Pro Leu Asp His Arg Gly Arg Thr Leu Glu lie 
945 950 



50 



Pro Gly Asn Ser Asp Pro Asn Met lie Pro Asp Gly Asp Phe Asn Ser 



965 



55 Tyr Val Arg Val Thr Ala Ser Asp Pro Leu Asp Thr Leu Gly Ser Glu 
980 985 

Gly Ala Leu §er Pro Gly Gly Val Ala Ser Leu Leu Arg Leu Pro Arg 
60 995 1000 

Gly Cys Gly Glu Gin Thr Met He Tyr Leu Ala Pro Thr Leu Ala 
1010 1015 

65 

Ala Ser Arg Tyr Leu Asp Lys Thr Glu Gin Trp Ser Thr Leu Pro 
1025 1030 



70 



Pro Glu Thr Lys Asp. His Ala Val Asp Leu lie Gin Lys Gly Tyr 
1040 1045 



75 Met Arg He Gin Gin Phe Arg Lys Ala Asp Gly Ser Tyr Ala Ala 
1055 luou 
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Trp Leu Ser Arg Asp Ser Ser Thr Trp Leu Thr Ala Phe Val Leu 
1070 1075 1080 

Lys Val Leu Ser Leu Ala Gin Glu Gin Val Gly Gly Ser Pro Glu 
1085 1090 1095 

Lys Leu Gin Glu Thr Ser Asn Trp Leu Leu Ser Gin Gin Gin Ala 
1100 1105 1110 

Asp Gly Ser Phe Gin Asp Pro Cys Pro Val Leu Asp Arg Ser Met 
1115 1120 1125 

Gin Gly Gly Leu Val Gly Asn Asp Glu Thr Val Ala Leu Thr Ala 
1130 1135 1140 

Phe Val Thr lie Ala Leu His His Gly Leu Ala Val Phe Gin Asp 
1145 1150 1155 

Glu Gly Ala Glu Pro Leu Lys Gin Arg Val Glu Ala Ser lie Ser 
1160 1165 1170 

Lys Ala Asn Ser Phe Leu Gly Glu Lys Ala Ser Ala Gly Leu Leu 
1175 1180 1185 

Gly Ala His Ala Ala Ala lie Thr Ala Tyr Ala Leu Ser Leu Thr 
1190 1195 1200 

Lys Ala Pro Val Asp Leu Leu Gly Val Ala His Asn Asn Leu Met 
1205 1210 1215 

Ala Met Ala Gin Glu Thr Gly Asp Asn Leu Tyr Trp Gly Ser Val 
1220 1225 1230 

Thr Gly Ser Gin Ser Asn Ala Val Ser Pro Thr Pro Ala Pro Arg 
1235 1240 1245 

Asn Pro Ser Asp Pro Met Pro Gin Ala Pro Ala Leu Trp lie Glu 
1250 1255 1260 

Thr Thr Ala Tyr Ala Leu Leu His Leu Leu Leu His Glu Gly Lys 
1265 1270 1275 

Ala Glu Met Ala Asp Gin Ala Ser Ala Trp Leu Thr Arg Gin Gly 
1280 1285 1290 

Ser Phe Gin Gly Gly Phe Arg Ser Thr Gin Asp Thr Val lie Ala 
1295 1300 1305 

Leu Asp Ala Leu Ser Ala Tyr Trp lie Ala Ser His Thr Thr Glu 
1310 1315 1320 

Glu Arg Gly Leu Asn Val Thr Leu Ser Ser Thr Gly Arg Asn Gly 
1325 1330 1335 

Phe Lys Ser His Ala Leu Gin Leu Asn Asn Arg Gin lie Arg Gly 
1340 1345 1350 
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Leu Glu Glu Glu Leu Gin Phe Ser Leu 
1355 1360 



Gly Ser Lys lie Asn Val 
1365 



Val Gly Gly Asn Ser Lys Gly Thr Leu Lys Val Leu Arg Thr 
1370 1375 1380 



Tyr Asn Val Leu Asp Met Lys Asn Thr Thr Cys Gin Asp Leu Gin 
1385 1390 1395 



lie Glu Val Thr Val Lys Gly His Val Glu Tyr Thr Met Glu Ala 
1400 1405 1410 



Asn Glu Asp Tyr Glu Asp Tyr Glu Tyr Asp Glu Leu Pro Ala Lys 
1415 1420 1425 



Asp Asp Pro Asp Ala Pro Leu Gin Pro Val Thr Pro Leu Gin Leu 
1430 1435 1440 



Phe Glu Gly Arg Arg Asn Arg Arg Arg Arg Glu Ala Pro Lys Val 
1445 " 1450 1455 



Val Glu Glu Gin Glu Ser Arg Val His Tyr Thr Val Cys lie Trp 
1460 1465 1470 



Arg Asn Gly Lys Val Gly Leu Ser Gly Met Ala lie Ala Asp Val 
1475 1480 1485 



Thr Leu Leu Ser Gly Phe His Ala Leu Arg Ala Asp Leu Glu Lys 
1490 " 1495 1500 



Leu Thr Ser Leu Ser Asp Arg Tyr Val Ser His Phe Glu Thr Glu 
1505 1510 1515 



Gly Pro His Val Leu Leu Tyr Phe Asp Ser Val Pro Thr Ser Arg 
1520 1525 1530 



Glu Cys Val Gly Phe Glu Ala Val Gin Glu Val Pro Val Gly Leu 
1535 1540 1545 



Val Gin Pro Ala Ser Ala Thr Leu Tyr Asp Tyr Tyr Asn Pro Glu 
1550 1555 1560 



Arg Arg Cys Ser Val Phe Tyr Gly Ala Pro Ser Lys Ser Arg Leu 
1565 1570 1575 



Leu Ala Thr Leu Cys Ser Ala Glu Val Cys Gin Cys Ala Glu Gly 
1580 1585 1590 



Lys Cys Pro Arg Gin Arg Arg Ala Leu Glu Arg Gly Leu Gin Asp 
1595 ^ 1600 1605 



Glu Asp Gly Tyr Arg Met Lys Phe Ala Cys Tyr Tyr Pro Arg Val 
1610 1615 1620 



Glu Tyr Gly Phe Gin Val Lys Val Leu Arg Glu Asp Ser Arg Ala 
1625 1630 1635 



Ala Phe Arg Leu Phe Glu Thr Lys He Thr Gin Val Leu His Phe 
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1640 1645 1650 

Thr Lys Asp Val Lys Ala Ala Ala Asn Gin Met Arg Asn Phe Leu 
5 1655 1660 1665 

Val Arg Ala Ser Cys Arg Leu Arg Leu Glu Pro Gly Lys Glu Tyr 
1670 1675 1680 

10 

Leu lie Met Gly Leu Asp Gly Ala Thr Tyr Asp Leu Glu Gly His 
1685 1690 1695 

15 

Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp lie Glu Glu Met Pro 
1700 1705 1710 

20 Ser Glu Arg Leu Cys Arg Ser Thr Arg Gin Arg Ala Ala Cys Ala 
1715 1720 1725 

Gin Leu Asn Asp Phe Leu Gin Glu Tyr Gly Thr Gin Gly Cys Gin 
25 1730 1735 1740 

Val 

30 

<210> 44 

<211> 1663 

<212> PRT 

35 <213> Homo sapiens 

<300> 

<308> Swiss-Prot/P01024 

<309> 1986-07-21 

40 <313> (1)..(1663) 

<400> 44 

Met Gly Pro Thr Ser Gly Pro Ser Leu Leu Leu Leu Leu Leu Thr His 
45 1 5 10 15 

Leu Pro Leu Ala Leu Gly Ser Pro Met Tyr Ser lie He Thr Pro Asn 
50 20 25 30 

He Leu Arg Leu Glu Ser Glu Glu Thr Met Val Leu Glu Ala His Asp 
35 40 45 

55 

Ala Gin Gly Asp Val Pro Val Thr Val Thr Val His Asp Phe Pro Gly 
50 55 60 

60 Lys Lys Leu Val Leu Ser Ser Glu Lys Thr Val Leu Thr Pro Ala Thr 
65 70 75 80 

Asn His Met Gly Asn Val Thr Phe Thr lie Pro Ala Asn Arg Glu Phe 
65 85 90 95 

Lys Ser Glu Lys Gly Arg Asn Lys Phe Val Thr Val Gin Ala Thr Phe 
100 105 110 

70 

Gly Thr Gin Val Val Glu Lys Val Val Leu Val Ser Leu Gin Ser Gly 
115 120 125 

75 

Tyr Leu Phe He Gin Thr Asp Lys Thr He Tyr Thr Pro Gly Ser Thr 
130 135 140 
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Val Leu Tyr Arg lie Phe Thr Val Asn His Lys Leu Leu Pro Val Gly 
145 ^ 150 155 160 

5 

Arg Thr Val Met Val Asn lie Glu Asn Pro Glu Gly lie Pro Val Lys 
165 170 175 

^ Gin Asp Ser Leu Ser Ser Gin Asn Gin Leu Gly Val Leu Pro Leu Ser 
180 185 190 

15 Trp Asp lie Pro Glu Leu Val Asn Met Gly Gin Trp Lys lie Arg Ala 
195 200 205 

Tyr Tyr Glu Asn Ser Pro Gin Gin Val Phe Ser Thr Glu Phe Glu Val 
20 210 215 220 

Lvs Glu Tyr Val Leu Pro Ser Phe Glu Val lie Val Glu Pro Thr Glu 
225 230 235 240 

25 

Lvs Phe Tyr Tyr lie Tyr Asn Glu Lys Gly Leu Glu Val Thr lie Thr 
245 250 255 

30 

Ala Arg Phe Leu Tyr Gly Lys Lys Val Glu Gly Thr Ala Phe Val lie 
260 265 270 

35 Phe Gly lie Gin Asp Gly Glu Gin Arg lie Ser Leu Pro Glu Ser Leu 
275 280 285 

Lys Arg lie Pro He Glu Asp Gly Ser Gly Glu Val Val Leu Ser Arg 
40 290 295 300 

Lys Val Leu Leu Asp Gly Val Gin Asn Leu Arg Ala Glu Asp Leu Val 
305 310 315 320 

45 

Glv Lvs Ser Leu Tyr Val Ser Ala Thr Val He Leu His Ser Gly Ser 
* 325 330 335 

50 Asp Met Val Gin Ala Glu Arg Ser Gly lie Pro He Val Thr Ser Pro 
340 345 350 

55 Tyr Gin He His Phe Thr Lys Thr Pro Lys Tyr Phe Lys Pro Gly Met 
355 360 365 

Pro Phe Asp Leu Met Val Phe Val Thr Asn Pro Asp Gly Ser Pro Ala 
60 370 375 380 

Tyr Arg Val Pro Val Ala Val Gin Gly Glu Asp Thr Val Gin Ser Leu 
385 390 395 400 

65 

Thr Gin Gly Asp Gly Val Ala Lys Leu Ser He Asn Thr His Pro Ser 
405 410 415 

70 

Gin Lys Pro Leu Ser He Thr Val Arg Thr Lys Lys Gin Glu Leu Ser 
420 425 430 

75 Glu Ala Glu Gin Ala Thr Arg Thr Met Gin Ala Leu Pro Tyr Ser Thr 
435 440 445 
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Val Gly Asn Ser Asn Asn Tyr Leu His Leu Ser Val Leu Arg Thr Glu 
450 455 460 

Leu Arg Pro Gly Glu Thr Leu Asn Val Asn Phe Leu Leu Arg Met Asp 
465 470 475 480 

Arg Ala His Glu Ala Lys lie Arg Tyr Tyr Thr Tyr Leu lie Met Asn 
485 490 495 

Lys Gly Arg Leu Leu Lys Ala Gly Arg Gin Val Arg Glu Pro Gly Gin 
500 505 510 

Asp Leu Val Val Leu Pro Leu Ser lie Thr Thr Asp Phe lie Pro Ser 
515 520 525 

Phe Arg Leu Val Ala Tyr Tyr Thr Leu He Gly Ala Ser Gly Gin Arg 
530 535 540 

Glu Val Val Ala Asp Ser Val Trp Val Asp Val Lys Asp Ser Cys Val 
545 550 555 560 

Gly Ser Leu Val Val Lys Ser Gly Gin Ser Glu Asp Arg Gin Pro Val 
565 570 575 

Pro Gly Gin Gin Met Thr Leu Lys He Glu Gly Asp His Gly .Ala Arg 
580 585 590 

Val Val Leu Val Ala Val Asp Lys Gly Val Phe Val Leu Asn Lys Lys 
595 600 605 

Asn Lys Leu Thr Gin Ser Lys He Trp Asp Val Val Glu Lys Ala Asp 
610 615 620 

He Gly Cys Thr Pro Gly Ser Gly Lys Asp Tyr Ala Gly Val Phe Ser 
625 630 635 640 

Asp Ala Giy Leu Thr Phe Thr Ser Ser Ser Gly Gin Gin Thr Ala Gin 
645 650 655 

Arg Ala Glu Leu Gin Cys Pro Gin Pro Ala Ala Arg Arg Arg Arg Ser 
660 665 670 

Val Gin Leu Thr Glu Lys Arg Met Asp Lys Val Gly Lys Tyr Pro Lys 
675 680 685 

Glu Leu Arg Lys Cys Cys Glu Asp Gly Met Arg Glu Asn Pro Met Arg 
690 695- 700 

Phe Ser Cys Gin Arg Arg Thr Arg Phe He Ser Leu Gly Glu Ala Cys 
705 710 715 720 

Lys Lys Val Phe Leu Asp Cys Cys Asn Tyr He Thr Glu Leu Arg Arg 
725 730 735 

Gin His Ala Arg Ala Ser His Leu Gly Leu Ala Arg Ser Asn Leu Asp 
740 745 750 
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Glu Asp He He Ala Glu Glu Asn He Val Ser Arg Ser GXu Phe Pro 
755 760 765 



5 Glu Ser Trp Leu Trp Asn Val Glu Asp Leu Lys Glu Pro Pro Lys Asn 
770 ^ 775 780 

Gly He Ser Thr Lys Leu Met Asn He Phe Leu Lys Asp Ser He Thr 
10 785 790 795 800 

Thr Trp Glu He Leu Ala Val Ser Met Ser Asp Lys Lys Gly He Cys 
805 810 815 

15 

Val Ala Asp Pro Phe Glu Val Thr Val Met Gin Asp Phe Phe He Asp 
820 825 830 

20 

Leu Arg Leu Pro Tyr Ser Val Val Arg Asn Glu Gin Val Glu He Arg 
835 840 845 

25 Ala Val Leu Tyr Asn Tyr Arg Gin Asn Gin Glu Leu Lys Val Arg Val 
850 855 860 

Glu Leu Leu His Asn Pro Ala Phe Cys Ser Leu Ala Thr Thr Lys Arg 
30 865 870 875 880 

Arg His Gin Gin Thr Val Thr He Pro Pro Lys Ser Ser Leu Ser Val 
885 890 895 

35 

Pro Tyr Val He Val Pro Leu Lys Thr Gly Leu Gin Glu Val Glu Val 
900 905 910 

40 

Lys Ala Ala Val Tyr His His Phe He Ser Asp Gly Val Arg Lys Ser 
915 920- 925 

45 Leu Lys Val Val Pro Glu Gly He Arg Met Asn Lys Thr Val Ala Val 
930 935 940 

Arg Thr Leu Asp Pro Glu Arg Leu Gly Arg Glu Gly Val Gin Lys Glu 
50 945 950 955 960 

Asp He Pro Pro Ala Asp Leu Ser Asp Gin Val Pro Asp Thr Glu Ser 
965 970 975 

55 

Glu Thr Arg He Leu Leu Gin Gly Thr Pro Val Ala Gin Met Thr Glu 
980 985 ' 990 

60 

Asp Ala Val Asp Ala Glu Arg Leu Lys His Leu He Val Thr Pro Ser 
995 1000 1005 

65 Gly Cys Gly Glu Gin Asn Met He Gly Met Thr Pro Thr Val He 
1010 1015 1020 

Ala Val His Tyr Leu Asp Glu Thr Glu Gin Trp Glu Lys Phe Gly 
70 1025 1030 1035 

Leu Glu Lys Arg Gin Gly Ala Leu Glu Leu He Lys Lys Gly Tyr 
1040 ~ 1045 1050 

75 



Thr Gin Gin Leu Ala Phe Arg Gin Pro Ser Ser Ala Phe Ala Ala 
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1055 1060 1065 

Phe Val Lys Arg Ala Pro Ser Thr Trp Leu Thr Ala Tyr Val Val 
1070 1075 1080 

Lys Val Phe Ser Leu Ala Val Asn Leu He Ala He Asp Ser Gin 
1085 1090 1095 

Val Leu Cys Gly Ala Val Lys Trp Leu He Leu Glu Lys Gin Lys 
1100 1105 1110 

Pro Asp Gly Val Phe Gin Glu Asp Ala Pro Val He His Gin Glu 
1115 1120 1125 

Met He Gly Gly Leu Arg Asn Asn Asn Glu Lys Asp Met Ala Leu 
1130 1135 1140 

Thr Ala Phe Val Leu He Ser Leu Gin Glu Ala Lys Asp He Cys 
1145 1150 1155 

Glu Glu Gin Val Asn Ser Leu Pro Gly Ser He Thr Lys Ala Gly 
1160 1165 1170 

Asp Phe Leu Glu Ala Asn Tyr Met Asn Leu Gin Arg Ser Tyr Thr 
1175 1180 1185 

Val Ala He Ala Gly Tyr Ala Leu Ala Gin Met Gly Arg Leu Lys 
1190 1195 1200 

Gly Pro Leu Leu Asn Lys Phe Leu Thr Thr Ala Lys Asp Lys Asn 
1205 1210 1215 

Arg Trp Glu Asp Pro Gly Lys Gin Leu Tyr Asn Val Glu Ala Thr 
1220 1225 1230 

Ser Tyr Ala Leu Leu Ala Leu Leu Gin Leu Lys Asp Phe Asp Phe 
1235 1240 1245 

Val Pro Pro Val Val Arg Trp Leu Asn Glu Gin Arg Tyr Tyr Gly 
1250 1255 1260 

Gly Gly Tyr Gly Ser Thr Gin Ala Thr Phe Met Val Phe Gin Ala 
1265 1270 1275 

Leu Ala Gin Tyr Gin Lys Asp Ala Pro Asp His Gin Glu Leu Asn 
1280 1285 1290 

Leu Asp Val Ser Leu Gin Leu Pro Ser Arg Ser Ser Lys He Thr 
1295 1300 1305 

His Arg He His Trp Glu Ser Ala Ser Leu Leu Arg Ser Glu Glu 
1310 1315 1320 

Thr Lys Glu Asn Glu Gly Phe Thr Val Thr Ala Glu Gly Lys Gly 
1325 1330 1335 

Gin Gly Thr Leu Ser Val Val Thr Met Tyr His Ala Lys Ala Lys 
1340 1345 1350 
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Asp Gin Leu Thr Cys Asn Lys Phe Asp Leu Lys Val Thr lie Lys 
1355 1360 1365 

Pro Ala Pro Glu Thr Glu Lys Arg Pro Gin Asp Ala Lys Asn Thr 
1370 1375 1380 

Met He Leu Glu He Cys Thr Arg Tyr Arg Gly Asp Gin Asp Ala 
1385 1390 1395 

Thr Met Ser He Leu Asp He Ser Met Met Thr Gly Phe Ala Pro 
1400 1405 1410 

Asp Thr Asp Asp Leu Lys Gin Leu Ala Asn Gly Val Asp Arg Tyr 
1415 1420 1425 

He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp Arg Asn Thr 
1430 1435 1440 

Leu He He Tyr Leu Asp Lys Val Ser His Ser Glu Asp Asp Cys 
1445 1450 1455 

Leu Ala Phe Lys Val His Gin Tyr Phe Asn Val Glu Leu He Gin 
1460 1465 1470 

Pro Gly Ala Val Lys Val Tyr Ala Tyr Tyr Asn Leu Glu Glu Ser 
1475 1480 1485 

Cys Thr Arg Phe Tyr His Pro Glu Lys Glu Asp Gly Lys Leu Asn 
1490 1495 1500 . 

Lys Leu Cys Arg Asp Glu Leu Cys Arg Cys Ala Glu Glu Asn Cys 
1505 1510 1515 

Phe He Gin Lys Ser Asp Asp Lys Val Thr Leu Glu Glu Arg Leu 
1520 1525 1530 

Asp Lys Ala Cys Glu Pro Gly Val Asp Tyr Val Tyr Lys Thr Arg 
1535 1540 1545 

Leu Val Lys Val Gin Leu Ser Asn Asp Phe Asp Glu Tyr He Met 
1550 1555 1560 

Ala He Glu Gin Thr He Lys Ser Gly Ser Asp Glu Val Gin Val 
1565 1570 1575 

Gly Gin Gin Arg Thr Phe He Ser Pro He Lys Cys Arg Glu Ala 
1580 1585 1590 

Leu Lys Leu Glu Glu Lys Lys His Tyr Leu Met Trp Gly Leu Ser 
1595 1600 1605 

Ser Asp Phe Trp Gly Glu Lys Pro Asn Leu Ser Tyr He He Gly 
1610 1615 1620 

Lys Asp Thr Trp Val Glu His Trp Pro Glu Glu Asp Glu Cys Gin 
1625 * 1630 1635 
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Asp Glu Glu Asn Gin Lys Gin Cys Gin Asp Leu Gly Ala Phe Thr 
1640 1645 1650 

Glu Ser Met Val Val Phe Gly Cys Pro Asn 
1655 1660 

<210> 45 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<300> 

<308> Swiss-Prot/Q9H299 

<309> 2003-02-28 

<313> (1>..(93) 

<400> 45 

Met Ser Gly Leu Arg Val Tyr Ser Thr Ser Val Thr Gly Ser Arg Glu 
1 5 10 15 

lie Lys Ser Gin Gin Ser Glu Val Thr Arg lie Leu Asp Gly Lys Arg 
20 25 30 

lie Gin Tyr Gin Leu Val Asp He Ser Gin Asp Asn Ala Leu Arg Asp 
35 40 45 

Glu Met Arg Ala Leu Ala Gly Asn Pro Lys Ala Thr Pro Pro Gin He 
• 50 55 60 

Val Asn Gly Asp Gin Tyr Cys Gly Asp Tyr Glu Leu Phe Val Glu Ala 
65 70 75 80 

Val Glu Gin Asn Thr Leu Gin Glu Phe Leu Lys Leu Ala 
85 90 

<210> 46 

<211> 567 

<212> PRT 

<213> Homo sapiens 

<300> 

<308> Swiss-Prot/Q96RQ9 

<309> 2003-02-28 

<313> <1)..<567) 

<400> 46 

Met Ala Pro Leu Ala Leu His Leu Leu Val Leu Val Pro He Leu Leu 
15 10 15 

Ser Leu Val Ala Ser Gin Asp Trp Lys Ala Glu Arg Ser Gin Asp Pro 
20 25 30 

Phe Glu Lys Cys Met Gin Asp Pro Asp Tyr Glu Gin Leu Leu Lys Val 
35 40 45 

Val Thr Trp Gly Leu Asn Arg Thr Leu Lys Pro Gin Arg Val He Val 
50 55 60 

Val Gly Ala Gly Val Ala Gly Leu Val Ala Ala Lys Val Leu Ser Asp 
65 70 75 80 
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Ala Gly His Lys Val Thr He Leu Glu Ala Asp Asn Arg He Gly Gly 
85 90 95 

5 Arg He Phe Thr Tyr Arg Asp Gin Asn Thr Gly Trp He Gly Glu Leu 
100 105 110 

Gly Ala Met Arg Met Pro Ser Ser His Arg He Leu His Lys Leu Cys 
10 H5 120 125 

Gin Gly Leu Gly Leu Asn Leu Thr Lys Phe Thr Gin Tyr Asp Lys Asn 
130 135 140 

15 

Thr Trp Thr Glu Val His Glu Val Lys Leu Arg Asn Tyr Val Val Glu 
145 150 155 160 

20 

Lvs Val Pro Glu Lys Leu Gly Tyr Ala Leu Arg Pro Gin Glu Lys Gly 
165 170 175 

25 His Ser Pro Glu Asp He Tyr Gin Met Ala Leu Asn Gin Ala Leu Lys 
180 185 190 

Asp Leu Lys Ala Leu Gly Cys Arg Lys Ala Met Lys Lys Phe Glu Arg 
30 195 200 205 

His Thr Leu Leu Glu Tyr Leu Leu Gly Glu Gly Asn Leu Ser Arg Pro 
210 215 220 

35 

Ala Val Gin Leu Leu Gly Asp Val Met Ser Glu Asp Gly Phe Phe Tyr 
225 230 235 240 

40 

Leu Ser Phe Ala Glu Ala Leu Arg Ala His Ser Cys Leu Ser Asp Arg 
245 250 255 

45 Leu Gin Tyr Ser Arg He Val Gly Gly Trp Asp Leu Leu Pro Arg Ala 
260 265 270 

Leu Leu Ser Ser Leu Ser Gly Leu Val Leu Leu Asn Ala Pro Val Val 
50 275 280 285 

Ala Met Thr Gin Gly Pro His Asp Val His Val Gin He Glu Thr Ser 
290 295 300 

55 

Pro Pro Ala Arg Asn Leu Lys Val Leu Lys Ala Asp Val Val Leu Leu 
305 310 315 320 

60 

Thr Ala Ser Gly Pro Ala Val Lys Arg He Thr Phe Ser Pro Pro Leu 
325 330 335 

65 Pro Arg His Met Gin Glu Ala Leu Arg Arg Leu His Tyr Val Pro Ala 
340 345 350 

Thr Lys Val Phe Leu Ser Phe Arg Arg Pro Phe Trp Arg Glu Glu His 
70 355 360 365 

He Glu Gly Gly His Ser Asn Thr Asp Arg Pro Ser Arg Met He Phe 
370 375 380 

75 

Tyr Pro Pro Pro Arg Glu Gly Ala Leu Leu Leu Ala Ser Tyr Thr Trp 
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385 390 395 400 

Ser Asp Ala Ala Ala Ala Phe Ala Gly Leu Ser Arg Glu Glu Ala Leu 
405 410 415 

Arg Leu Ala Leu Asp Asp Val Ala Ala Leu His Gly Pro Val Val Arg 
420 425 430 

Gin Leu Trp Asp Gly Thr Gly Val Val Lys Arg Trp Ala Glu Asp Gin 
435 440 445 

His Ser Gin Gly Gly Phe Val Val Gin Pro Pro Ala Leu Trp Gin Thr 
450 455 460 

Glu Lys Asp Asp Trp Thr Val Pro Tyr Gly Arg lie Tyr Phe Ala Gly 
465 470 475 480 

Glu His Thr Ala Tyr Pro His Gly Trp Val Glu Thr Ala Val Lys Ser 
485 490 495 

Ala Leu Arg Ala Ala lie Lys lie Asn Ser Arg Lys Gly Pro Ala Ser 
500 505 510 

Asp Thr Ala Ser Pro Glu Gly His Ala Ser Asp Met Glu Gly Gin Gly 
515 520 525 

His Val His Gly Val Ala Ser Ser Pro Ser His Asp Leu Ala Lys Glu 
530 535 540 

Glu Gly Ser His Pro Pro Val Gin Gly Gin Leu Ser Leu Gin Asn Thr 
545 550 555 560 

Thr His Thr Arg Thr Ser His 
565 



<210> 


47 


<211> 


462 


<212> 


PRT 


<213> 


Homo sapiens 


<300> 




<308> 


Swiss-Prot/P02790 


<309> 


1986-07-21 


<313> 


(1) . . (462) 


<400> 


47 


Met Ala Arg Val Leu Gly Ala 


1 


5 



10 15 

Cys Trp Ser Leu Ala lie Ala Thr Pro Leu Pro Pro Thr Ser Ala His 
20 25 30 

Gly Asn Val Ala Glu Gly Glu Thr Lys Pro Asp Pro Asp Val Thr Glu 
35 40 45 

Arg Cys Ser Asp Gly Trp Ser Phe Asp Ala Thr Thr Leu Asp Asp Asn 
50 55 60 

Gly Thr Met Leu Phe Phe Lys Gly Glu Phe Val Trp Lys Ser His Lys 
65 70 75 80 



10 
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Trp Asp Arg Glu Leu lie Ser Glu Arg Trp Lys Asn Phe Pro Ser Pro 
85 90 95 

Val Asp Ala Ala Phe Arg Gin Gly His Asn Ser Val Phe Leu He Lys 
100 105 HO 

Gly Asp Lys Val Trp Val Tyr Pro Pro Glu Lys Lys Glu Lys Gly Tyr 
115 120 125 



15 Pro Lys Leu Leu Gin Asp Glu Phe Pro Gly He Pro Ser Pro Leu Asp 
130 135 140 



Ala Ala Val Glu Cys His Arg Gly Glu Cys Gin Ala Glu Gly Val Leu 
20 145 150 155 160 



Phe Phe Gin Gly Asp Arg Glu Trp Phe Trp Asp Leu Ala Thr Gly Thr 
165 170 175 

25 

Met Lys Glu Arg Ser Trp Pro Ala Val Gly Asn Cys Ser Ser Ala Leu 
180 185 190 



30 



Arg Trp Leu Gly Arg Tyr Tyr Cys Phe Gin Gly Asn Gin Phe Leu Arg 
195 200 205 



35 Phe Asp Pro Val Arg Gly Glu Val Pro Pro Arg Tyr Pro Arg Asp Val 
210 215 220 



Arg Asp Tyr Phe Met Pro Cys Pro Gly Arg Gly His Gly His Arg Asn 
40 225 230 235 240 



Gly Thr Gly His Gly Asn Ser Thr His His Gly Pro Glu Tyr Met Arg 
245 250 255 

45 

Cys Ser Pro His Leu Val Leu Ser Ala Leu Thr Ser Asp Asn His Gly 
260 265 270 

50 

Ala Thr Tyr Ala Phe Ser Gly Thr His Tyr Trp Arg Leu Asp Thr Ser 
275 280 285 



55 Arg Asp Gly Trp His Ser Trp Pro He Ala His Gin Trp Pro Gin Gly 



95 



290 295 300 



Pro Ser Ala Val Asp Ala Ala Phe Ser Trp Glu Glu Lys Leu Tyr Leu 
305 310 315 320 



Val Gin Gly Thr Gin Val Tyr Val Phe Leu Thr Lys Gly Gly Tyr Thr 
325 330 335 



Leu Val Ser Gly Tyr Pro Lys Arg Leu Glu Lys Glu Val Gly Thr Pro 
340 345 350 



His Gly He He Leu Asp Ser Val Asp Ala Ala Phe He Cys Pro Gly 
355 360 365 



Ser Ser Arg Leu His He Met Ala Gly Arg Arg Leu Trp Trp Leu Asp 
370 375 380 



Leu Lys Ser Gly Ala Gin Ala Thr Trp Thr Glu Leu Pro Trp Pro His 
385 390 395 400 



Glu Lys Val Asp Gly Ala Leu Cys Met Glu Lys Ser Leu Gly Pro Asn 
405 410 415 



Ser Cys Ser Ala Asn Gly Pro Gly Leu Tyr Leu He His Gly Pro Asn 
420 425 430 



Leu Tyr Cys Tyr Ser Asp Val Glu Lys Leu Asn Ala Ala Lys Ala Leu 
435 440 445 



Pro Gin Pro Gin Asn Val Thr Ser Leu Leu Gly Cys Thr His 
450 455 460 



<210> 48 

<211> 369 

<212> PRT 

<213> Homo sapiens 



<300> 

<308> Swiss-Prot/P50502 
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<309> 1996-10-02 
<313> (X)..(369) 

<400> 48 

5 

Met Asp Pro Arg Lys Val Asn Glu Leu Arg Ala Phe Val Lys Met Cys 
15 10 15 



10 Lys Gin Asp Pro Ser Val Leu His Thr Glu Glu Met Arg Phe Leu Arg 
20 25 30 



Glu Trp Val Glu Ser Met Gly Gly Lys Val Pro Pro Ala Thr Gin Lys 
15 35 40 45 



Ala Lys Ser Glu Glu Asn Thr Lys Glu Glu Lys Pro Asp Ser Lys Lys 
50 55 60 

20 

Val Glu Glu Asp Leu Lys Ala Asp Glu Pro Ser Ser Glu Glu Ser Asp 
65 70 75 80 

25 

Leu Glu lie Asp Lys Glu Gly Val lie Glu Pro Asp Thr Asp Ala Pro 
85 90 95 



30 Gin Glu Met Gly Asp Glu Asn Ala Glu lie Thr Glu Glu Met Met Asp 
100 105 110 



Gin Ala Asn Asp Lys Lys Val Ala Ala lie Glu Ala Leu Asn Asp Gly 
35 115 120 125 



Glu Leu Gin Lys Ala He Asp Leu Phe Thr Asp Ala He Lys Leu Asn 
130 135 140 

40 

Pro Arg Leu Ala He Leu Tyr Ala Lys Arg Ala Ser Val Phe Val Lys 
145 150 155 160 

45 

Leu Gin Lys Pro Asn Ala Ala lie Arg Asp Cys Asp Arg Ala He Glu 
165 170 175 



50 He Asn Pro Asp Ser Ala Gin Pro Tyr Lys Trp Arg Gly Lys Ala His 
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180 185 190 



Arg Leu Leu Gly His Trp Glu Glu Ala Ala His Asp Leu Ala Leu Ala 
195 200 205 



Cys Lys Leu Asp Tyr Asp Glu Asp Ala Ser Ala Met Leu Lys Glu Val 
210 215 220 



Gin Pro Arg Ala Gin Lys He Ala Glu His Arg Arg Lys Tyr Glu Arg 
225 230 235 240 



Lys Arg Glu Glu Arg Glu He Lys Glu Arg He Glu Arg Val Lys Lys 
245 250 255 



Ala Arg Glu Glu His Glu Arg Ala Gin Arg Glu Glu Glu Ala Arg Arg 
260 265 270 



Gin Ser Gly Ala Gin Tyr Gly Ser Phe Pro Gly Gly Phe Pro Gly Gly 
275 280 285 



Met Pro Gly Asn Phe Pro Gly Gly Met Pro Gly Met Gly Gly Gly Met 
290 295 300 



Pro Gly Met Ala Gly Met Pro Gly Leu Asn Glu He Leu Ser Asp Pro 
305 310 315 320 



Glu Val Leu Ala Ala Met Gin Asp Pro Glu Val Met Val Ala Phe Gin 
325 330 335 



Asp Val Ala Gin Asn Pro Ala Asn Met Ser Lys Tyr Gin Ser Asn Pro 
340 345 350 



Lys Val Met Asn Leu He Ser Lys Leu Ser Ala Lys Phe Gly Gly Gin 
355 360 365 



Ala 



<210> 49 

<211> 9 

<212> PRT 

5 <213> Homo sapiens 

<400> 49 

Met Gin Leu Met His Ala Asn Ala Gin 
10 1 5 



<210> 50 

<211> 9 

15 <212> PRT 

<213> Homo sapiens 

<400> 50 

20 Leu Thr Leu Asp Ser Asn Tnr Lys Tyr 
1 5 



<210> 51 

25 <211> 9 

<212> PRT 

<213> Homo sapiens 



30 



<400> 51 

Phe Val He Asp Lys Ser Gly Ser Met 
1 5 



35 <210> 52 

<211> 9 

<212> PRT 

<213> Homo sapiens 

40 <400> 52 

Tyr Leu Leu Asp Ser Asn Ser Trp He 
1 5 

45 

<210> 53 

<211> 9 

<212> PRT 

<213> Homo sapiens 



50 



<400> 53 

Tyr Glu Leu Asp Lys Ala Phe Ser Asp 
1 5 



<210> 54 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 54 

lie Lys Ser Gin Gin Ser Glu Val Thr 
1 5 



<210> 55 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 55 

Val Gin lie Glu Thr Ser Pro Pro Ala 
1 5 



<210> 56 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 56 

lie lie Leu Asp Ser Val Asp Ala Ala 
1 5 



<210> 57 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 57 



lie Glu Pro Asp Thr Asp Ala Pro Gin 
1 5 
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